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FOREWORD 


Parks Canada is pleased to make this material 
available to the research communitye The contents) of this 
volume are truly as the subtitle suggests - technicale 
Whereas Volume I in this series is intended primarily for 
administrative and policy personnel, this volume is intended 
primarily for researcherse The contents include a wealth of 
new and exciting approaches to the study of outdoor 
recreation and recreation research in generale 

In 1967, when Knetsch produced an exploratory CORD 
Study research program, the state of recreation research was 
embryonice At that time no one could have’ predicted which 
available methodological techniques essential to the CORD 
Study needed to be revised and which new techniques would 
need to be developede Work on park-use survey methods was 
released in. 1968e The’ only discussions of motivation factors 
affecting participation rates that were being produced at 
that time were theoretical rather than practicale Needless 
to Say each of us is aware of the rapid development ot 
computer useage and methodolgy that has had a revolutionary 
eifect upon tlarge-scale data processing and analysis since 
19¢é7. Demand estimation methodologies were published in 
19696 All this occurred white CORD Study researchers were 
in the field collecting datae 

As this volume indicates, new analysis techniques had 
to be developed throughout the Studye Some of these 
techniques are still undergoing testing and refinemente In 
many wayS, Canada has assumed a leadership role in this 
efforte As Late as 1974, the United States Bureau of Outdoor 
Recreation still perceived the field of outdoor recreation 
to be in such a state that it held a National Seminar to 
aefine recreation research needSe 

Considering all of the factors cited, it would have 
been easy to have abandoned the CORD Study in the face ot 
developments which suggested possibly better ways to achieve 
the aims of the Studye However, by taking a flexible view of 
the original design, and solving methodological problems as 
they arose, the Study has produced much that can be 
considered a major contribution to research methodologye 
This volume then, in the years to come, will provide 
researchers and others with a text that can serve as a 
methodological reference and as an instructional sourcee 


John Ie Nicol, 
Director General, 
Parks Canadae 
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A KEY TO READING THE EQUATIONS IN THE TEX 


Because of the Limitations imposed by the 
print train used to produce this text, certain sub 
of standard printed mathematical notations were 
The following is a guide to these: 


TEXT NOTATION STANDARD NOTATION 
aE a (summation ) 
PROD II (product of s 

terms ) 
PI WT (30141 Geee ) 
x*xXy x? (x raised to 
of y) 
x**( 1/2) y¥x (square root 
expY ey (e raised to 
of y) 
& (= (error term) 
a{ x | ex (differential 
t J (integration ) 
L r (Lambda ) 
2 2 : 
X x (chi-squared ) 
approx = = (approximatel 
some symbols, eepgre — "EL" have been used in 
certain arbitary notations (e€ere —  )e The meanin 
symbols have been explained in the texte In most 


when modifications to an equation were necessary 
the Length of a printed line or Sspacin,y of tex 
notation was usede 


4 
computer 


stitutions 
necessarye 


everal 


the power 


ot x) 


the power 


) 


y equal to) 


place of — 
x of these | 
instances, 
because of 
t, FORTRAN | 


CHAPTER I 


INTRODUCTION TO THE 
CANADIAN OUTDOOR RECREATION DEMAND STUDY TECHNICAL NOTES 


GENERALITIES 


The Canadian Outdoor Recreation Demand (CORD) Study 


Technical Notes are a tangible product of the CORD Studye 
|'Foltlowing the acceptance of the design by JeLe Knetsch (see 
'DESIGN FOR ASSESSING OUTDOOR RECREATION DEMAND IN CANADA 
below), which contained plans for data collection, it was 
necessary to specify the analyses needede Technical Note 
CIN) 34, the "Working Paper", was prepared as a first step 
in meeting the need for specific analysis strategiese In TN 
34, Knetsch stated: 


If demand relationships can be understood with 
even a modicum of Success, various kinds of 
productive analyses are possible that provide at 
least partial answers to many difficult problemsSe 
This is not to suggest, as was pointed out in the 
original design paper; that demand analyses can 
provide clear-cut answers to planning problemse 
They cannote Rather, it is to suggest that demand 
analyses can provide important information useful 
for planning and policy decisionse 


An integral aspect of research must be to anticipate 


| policy and planning needs 3 however, it is necessary to 
remember that every Canadian Parks organization has its own 
particular policies ana prioritiese No one research effort 
can concentrate on all needs of all organizations; rather, 
all organizations involved in recreation research will have 


onty certain basic concerns in commone While policies and 


objectives may varyy there will be large areas in which 


important exchanges can take placee These can include 
exchanges ot quantitative information, of conceptual 
aevelopment, and of research tindings related to methods and 
ways of evaluating policye It is within the context of such 
exchange that a CORD Study type of cooperative effort has 
the epreatest promise of succesSse 

The CORD Stuay could never provide specific answers to 
questions about how much outdoor recreation Land should be 
acouired in Canada, and wheree The “why" behind this is very 
Sirply that the "shoulds" in relation to need for recreation 
land can be defined only when comprehensive recreation 
poticies are specitiede Land policy, manpower or program 
needs, should only be specified in a plan when details ot 
budgetary priorities and the issues of trade-offs between 
having parks and historic sites rather than hospitals and 
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schools (or at least the trade-otfs between competing 
recreation activities) have been worked oute Part of 
negotiating such trade-offs, (as indicated in the "Working 
Paper"), from an economist's perspective should include 
learning what value is put on park or historic site use, in 
comparison to other consumptione 

In 1876 (Cas in 1967), when one starts to consider using 
"research" to aid policy makers and planners, one finds a 
rather "shabby" collection of toolSe Even though the CORD 
Study made major progress in many areaS, more methodological 
development remains to be donee Often, what staff 
researchers and consultants do for organizations using 
accepted methods is methodologically unsound, and unbiased 
"intuition" might offer better results (and cost much less )e 

In preparing this volume CORD Study researchers have 
had to confront two sets of circumstances, 1) a set of 
circumstances suggesting the necessity of a volume such as 
this, and 2) situations that have pointed out the fallacy 
Ot, and problems with conclusions that show how much 
hurting, fishing, driving for pleasure, tent campingy etce, 
there witltl or should be in the futuree Although many 
researchers and consultants estimate "demana", and conduct 
“Sophisticated modelling", such work is scratching the 
suriace of a very complex probleme Generally, in a 
conceptual, structural, and statistical sense, geod models 
and? methods have not been definede For example debate on 
the use of consumer surplus to measure park value was at its 
peak in the mid-sixtiese Critiques of work in these areas 
were needed to give researchers Zuidance so they could know 
which work by others should be used and which should note 
COFD Study Technical Notes to a large extent are intended to 
Ooiter researchers this type otf critiquee 

‘iN OG4, the Working Papers is presented to indicate how 
the COrD Stuay analysis developed along Lines laid down in 
1146S. It Eroviaes a framework in which to illustrate how 
eertain Technical Notes relate to the eitorts made to 
understand the demand for outdoor recreatione It also 
ineicates the rationale for dividing this volume into 
various collections of Technical Notese 

Given the methodolovical thrust of the CCRD Stuay and 
Ziven that the "Working Paper" specified certain research 
areas and relatea methodological problems, the Technical 
Notes came to be classifiedi under a number of specific 
heauingse The classification scheme used in this volume is 
only one possible classification scheme for organizing the 
Notes (See table 2% )e Part ot the problem in classitying the 
Notes arises because a Single Note can ove placed under 
several headingsSe The non-mutually exclusive classification 
of some ot the Notes given in Table 2 illustrates this 
Erohbleme lIhus the inclusion of an article in a particular 
cha, ter is somewhat arbitrarye 

introductory material to the Notes in each chapter 
ekives the reaaer some initormation about the events leading 
tao the preparation of svecific Technical Notes and provides 
other bacnervround that the texnnieat editor pelieves the 
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reader will find interesting and usefule 
The reviews at the ends of each chapter differ 
substantially from each other in terms of 1) their thrust or 
objective and 2) individual author's stylee The reviews do 
not point out detailed relationships between Notes in one 
chapter and Notes ina Later chapter, nor do they make 
extensive, detailed or subtle reference to Notes in earlier 
chapterse The approach taken was to make each review 
primarily a review of the Notes in one chaptere Discussion 
of relationships and implications between different Notes 
is found in Chapters X and Xie However, numerous cross- 
referencing does take placee 
| Table 1 indicates in numerical order, short titles for 
the forty-two CORD Study Technical Notese This table is 
|provided to facilitate quick reference to the Technical 
| Notes cited only by number in most referencese 
There has been no attempt to make the volume  self- 
contained by providing lengthy explanations or descriptions 
of other researche This volume is intended for recreation 
research personnel who keep up with the demand analysis 
Literaturee Nevertheless, some of the references are 
obscure, and to prevent problems in locating such material, 
|Parks Canada has made copies available of all listed 
references which are not in journals or otherwise readily 
availablee These documents are on file in the LEISURE 
STULIES DATA BANK, of The Waterloo Research Institute; and 
copies can be obtained at coSte 

Finally, this volume does not provide general 
information on the Canadian Outdoor Recreation Demand Study, 
nor are there details on collection or processing of CORD 
Study datae Persons interested in generalities about the 
~CCRD study may refer to Volume I: OVERVIEW AND ASSESSMENT. 
These who wish to have more details on CORD Study data 
collection and processing, or who wish to replicate analyses 
or carry out new analyses, may see Volume IIL: DATA 
COULECTICN AND LCDOCUNENTATIONe References to both of these 
volumes in this work are given using abbreviated titlese 
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A DESIGN FOR ASSESSING 
OUTDOOR RECREATION DEMANDS IN CANADA (1967) 


JeLe Knetsch 


This report presents the design of a series of 
investigations to assess the demand for outdoor recreation 
in Canadae The purpose is to set out the major conclusions 
of this design etfort, and to focus attention on the more 
critical points on which decisions will need to be made in 
the near futuree The intent is to discuss only major 
inferences and concernsS, and to point out those to which the 
earliest attention should be givene Extended discussion of 
the many methodological points and findings reached in the 
course of the design study can better be brought out in 
other communications and reportSe See especially the CORD 
Study Data Collection and Documentation Volumese 


There are six major areas of conclusions: 


fe A critical need exists to have afar more complete 
understanding and measurement of outdoor recreation 
demands in Canada, to guide investment and management 
planning, to identify and evaluate policy choices, and to 
forecast recreation use of resources in Canada as it 
relates to alternative development proposalse 


2e Demand studies are fully feasible at the current time 
within reasonable cost lLimitse Results ot the 
contemplated studies would be directly applicable to 
outdoor recreation program planning at the several levels 
of responsibility in Canadae Demand studies do not 
directly supply policy and planning solutions, but they 
do provide an important basis for making better 
decisionse 


Ge The necessary investigations should be done as a series 
ot complementing tasks which combine into a strategy of 
studies rather than as a Single, more Limiting stuay ot 
outdoor recreation demande None by itself is a demand 
stuuy, but the whole series will provide needed demand 
estimatese 


4.- The major components of the overall study would center 
first on tasks primarily to acquire data from: 
ae household surveys, 
be park user surveys, 
ce supply inventoriese 


The study woulda then utilize the results of these surveys 
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Se 


Ge 


in a series ot analytical steps to assess demands and 
needs, and to formulate planning and policy implicationse 
The result of the entire effort is heavily dependent on 
the data programe Work on each of the three data programs 
Should be initiated early to supply the necessary 
information in sequence and to provide a more 
comprehensive data basee There is a need for 
compatibility and uniformity in outdoor recreation data 
necessary for the demand study - which is collected by 
the ditferent agencies and jurisdictionse While a focus 
may be on the National Parks, data and exchanges of data 
from other sources, especially the Provinces, are 
necessary to assess the demand for a wider spectrum of 
outdoor recreation and to appraise meaningfully the 
demand for use of the federal arease Further, as the use 
of each recreation area, regardless of jurisdictional 
responsibility, bears relation to the use of others ina 
region, data associated with a full range of recreation 
opportunities will need to be obtainede 


The several studies which are to be undertaken should be 
done in part by responsible agencies and organizations 
outside of Parks Canada and in part by staff members of 
the Departmente The capabilities of the different groups 
differ widely, as do the requirements tor individual 
tasks in the demand studiese Responsibility for 
coordination of the studies should best remain within 
Parks Canadae 


The studies should be structured to provide for a 
continuing process of demand assessmente Demand responds 
to alterations in the population and to changes in the 
facilities and opportunities provided to that populatione 


Theretore, there can be no “once and for alt" 
determination of “the outdoor recreation demande" To 
provide the useful guide to planning and policy for which 
it is designed, the assessments wilt need to be 


continuinge The major cost of the demand studies will be 
in the early stages with continuing programs of demand 
studies becoming far more routine, and subsequently, less 
expenSivee 


OUTDOOR RECREATION AND DEMAND 


A result of the accelerating effect of technological, 


economic, politically and social changes in our society and 
in our environment has been a rapidly increasing awareness 
©O1 environmental amenities, their destruction, their Lack ot 
availability, and increases in the demands for theme This is 
esrecially true, and seems to find the sharpest focus ing, 
the case oi parks and outdoor recreatione It has been amply 
demonstrateu that the demand tor outdoor recreation is Large 
and is growinge 1f the physical opportunities exist —- and 
that is important — many people choose to spena time and 
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money on outdoor recreatione This is reflected in all series 
of statistics or indices of magnitude —- attendance counts, 
license fees collected, equipment sales, or recreation 
travele 

It is not enough to know that demand is increasinge 
Public bodies — at national, provincial, and local levels —- 
and private firms need to have the nature of demand for 
activities and facilities defined in such a way that they 
can make rational policy and investment decisions to meet 
the expressed desires of the various user publLlicSe 

The chief need is a better definition and examination 
of the demand for recreation activities and facilities that 
vary by type, Location, and managemente Information is 
required on the future demand for present facilities and for 
alternative areas and facilities that might be addede 

Demand studies have a very practical purposee Their 
objective is to yield estimates or forecasts for improving 
or adjusting the supply of recreation opportunities and to 
estimate the probable effects of alternative programs and 
policiese Recreation planners at any jurisdictional level 
remain at a loss without some notion of how recreation use 
witl respond to alternative recreation opportunities that 
might be providede 

What is needed is not a collection of miscellaneous 
facts, but an understanding of the relationships inherent in 
recreation behavior and ability to forecast the effects of 
proposea alternative actionSe A more efficient ana 
eguitable provision of recreation opportunities is dependent 
upon the recognition of the wide spectrum of outdoor 
recreation possibilities in all regions of the country; and 
an assessment of the relative demands for ditferent segments 
of this spectrume Demand statements are tangible expressions 
ot personal values that are most significant as guides to 
what people actually wante 

Park and recreation area attendance statistics are 
collected by nearly all public park and recreation agencieSse 
While there are difficulties associated with the accuracy 
and comparability of these tigures, the major weakness is 
that far mcre data and analysis are required before an 
exnrlanation can be otfereadc tor the patterns of use that 
occure If the statistics from different areas are to be 
meaningful, and if much is to be lLearnea from them, they 
must be related to economic, social, and physical 
environmentSe 

Many past studies of recreation demand have yielded far 
Less than would have been possible with similar resourcese A 
major tactor has been a miSunderstanding of the quantitative 
resultse This in large part stems from confusion of 
recreation demand terminology, #iven wide currency by the 
UeSe Outdoor Recreation Kesource keview Commission studieSse 
Many have unfortunately and uncritically read into this 
early work, and subsequently propagated, erroneous concepts 
of Ggemanae The trouble arises from a confusion over the 
difference between demand and consumptione Use or attendance 
firures are incorrectly called aemandGy instead of being 
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interpreted as consumption or the interaction of both 
demand, which certainly exists, and supply of opportunities, 
which also existse The amount of recreation demand varies 
with the number of facilities available, thereby making some 
quantitative estimate of this dependence necessary for 
sensible planninge 

This is more than semanticse It can direct planning 
efforts to wrong conclusions or to irrelevancies and blunt 
plans and investment policy in outdoor recreatione As 
Professor Wantrup warned years ago: 


“existing projections of land and water use are 
neither conceptually nor empirically identical 
with projections of land and water demande In the 
first place, use projections do not separate 
demand/supply conceptually nor statisticallye If 
demand is to serve as a principal orientation for 
public land and water policy —- that is to help in 
planning on the - supply side —- problems of demand 
and supply need to be separated conceptually and 
in empirical investigation, variables pertaining 
to demand must be differentiated from those 
pertaining tc supplye" 


The warning has been repeated in the technical 
literature, and empirical evidence supports the importance 
of distinguishing between consumption and demande 


Consumption data refer to and are measures of participatione 
Participation is dependent upon demand but dst is also 
derendent upon supplye In fact, participation rates seem 
more likely to wary with the supply of opportunities than 
with demand factorse For example, people make wreater use 
of water recreation facilities per capita in the Maritimes 
than they do in the Prairies, the differences having tar 
more to do with the availability of water than with 
differences in incomes, education, or age distribution 
between the two populationse The point is that observation 
of what occurs will not alone permit judgments of relative 
demandSe 

The extent to which the standardized methods and an 
improper notion and interpretation otf demand can go astray 
was illustrated by public discussion of the Wisconsin 


Council of Ontario's Conference on Parks and Outdoor 
Recreation in April of 1267« The evaluation ot the user 
Statistics as demand followed the prevalent "demand" 


determination methodse This ted to the conclusion that some 
933,000,000 was needed in further recreation tacilities to 
catch up with demande If these addea facilities were put in 
place and the same study repeated, it would unfortunately 
Show that instead of having met the demand with the 
expnenditure, as would have been expected, the gap between 
supply and demand would be even greatert The added facility 
supply would have brought about increased visitor numbers 
which would then be mistakenly read as even greater 
"demand". 
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A methodology which mistakes consumption, or in the 
case of recreation, visits or use with demand, will indicate 
more demand for the same type of facilities in those areas 
where more of the facilities are lLocatede This can have the 
important consequence of perpetuating the kind of facilities 
already in place in the areas already best servede Thus, as 
facilities are developed and used; new studies would report 
that more of the same should be built in these placese 
Nearly any manner or amount of investment can be "justified" 
and investment decisions may be severely warpede 
Furthermore, some of the most important recreation demands. 
of the population are never brought to lighte It says, in 
effect, that if only white bread is available for purchases, 
the subsequent buying of white bread by people shows that. 
there exists a daemand for white bread but none for brown and. 
only more white bread should be suppliede | 

The standardization of methodology; the bulk of the 
reports, the mass of whachine-derived numbers, and the dollar. 
cost involved do not.lessen the inherent weakness of these | 
procedurese The concepts underlining the methods are simply 
inadequate for the intended purpose, making the subsequent 
collection of facts and figures almost worthlesse 

The single most serious and most fundamental deficiency 
in most demand surveys and studies is that they do not) 
provide any means ot determining how recreation use will 
respond to changes in supply —- and that, after all, is the. 
portion on which guidance is needede The studies, 
consequently, are of little value as an aid to planning or. 
to policy decisionse 

Recreation demand studies, to be useful for planning 
purposes, must consider the etfect of both supply and demand 
factors on recreation use or participatione Use data in the 
form of participation rates of population segments or visits 
tc recreation areas must be obtained, but the interpretation 
must consider that both demand and supply variables explain 
or determine these rateSe That is, the emphasis should be 
placed on determining and explaining patterns of use which 
emerge given an availability of opportunities and the 
characteristics of the using populationse Data should be 
collected and analyzed in such a way that predictions can 
be made of how the use patterns would be expected to change 


————— 


given changes in supply » that iS, changes in the 
availability of recreation opportunitye This would allow 
estimates to be made of the consequences of varying 


recreation investments or varying recreation policiesSe For 
example, it would then be possible to make a forecast of the 
use that might be expected of a proposed provincial 
recreation area, taking explicitly into account such things 
as its location, size, facilities, and the existence of 
other recreation areas in proximity to ite 

The important implication of this for the design of 
recreation demand investigations is that it is quite 
impossible to carry out studies in a meaningful way by only 
asking people how many times they participated in various 
kinds of outdoor recreation activitiese The supply of 
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opportunities is ignored; consequently, very little of the 
difference in participation rates among different 
individuals or even groups of individuals can be explainede 
A far more critical part of the explanation of recreation 
behavior is the supply side of the equatione 

Statements can be made of the demand in Canada for any 
commodity -—- automobiles, houses; or beef — but these must be 
taken to mean the quantity demanded at the prevailing 
prices, incomes and given levels of other factorse If these 
prices or other factors were different, the amount demanded 
would differe The demand for outdoor recreation similarly 
depenas on various factorse One of the most important of 
these is the availability of opportunities, there can be no 
set quantity of recreation demanded by a populatione Any 
prediction of recreation demand must, therefore, consider 
supply elements. 

Listings ot recreation areas, with their 
characteristics and other inventory data, are compiled in 
increasingly complete fashione However, owing to lack of 
standardization in facility measurement, the difficulty of 
assessing general resource availability for activities such 
as hunting and fishing which take place outside organized 
recreation areas, and the complexity of coping with quality 
differences, measures of supply are not easily quantifiede 
Whatever explicit measures of supply are available must be 
made an integral part of any studies to be undertakene 

The demand study should be designed to yield data for 
making estimates and gaining insights into the recreation 
demand phenomena throughout Canada for a very broad spectrum 
of outdoor activitiese The spectrum of outdoor recreation 
sheuld be as wide as possiblee Remote recreation 
oprortunities such as many of the National Parks, various 
tyres of intermediate areas such as many Provincial Parks, 
and user-oriented recreation areas typified by urban parks; 
Shoeula all be includede Studies should point out the demands 
for outdoor recreation in Canada, the roles’ played both by 
the National Parks and by other park and recreation areas in 
meeting this demande In setting forth a broad view of 
recreation as a point of departure, insight can be gained 
into the varied demands for recreation and the 
interrelationships between these demandse 

The data should be collected and analyzed in such a way 
that estimates otf the complementarities and competitive 
asrects between recreation areas and facilities can be 
estimated, however crudelye [It should be possible to Say; 
for example, that if the tacilities or opportunities are 
increased in an area by a certain amount, that the 
visitation to existinzw areas should go upy down, or remain 
unchanpede This would have very important implications for 
investment decisions and policy formulatione For instance, 
it would be of great interest to Know the effect of 
development of Provincial parks and recreation areas on the 


use which is made of National Parks and — probably even more 
imrortant and of greater magnitude ~ the eftect ot 


development of urban-oriented outdoor recreation areas on 
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Provincial parkse 

Another aspect which the demand studies must consider 
is the pent-up demands not now currently being expressed by 
the population because of the non-availability of facilities 
or areaSe The appraisal of this aspect of demand can prove 
most difficult. It is one thing to estimate how use would 
vary depending upon the proximity of population to a park or 
recreation area of a given type and quality, but it is quite 
another problem to estimate the use which would take place 
in an area or at a facility which does not currently existe 
Some estimate or feeling beyond sheer conjecture of such 
demands is needede The demands can perhaps be approximated 
by observing close parallels and extrapolating from this 
experience, or they may be partially assessed through 
household surveySse 

Another general problem which is also difficult is to 
establish demand parameters in such a way that useful 
statements can be made about changes in demands over time, 
particularly over the long periods of time which are 
implicitly contemplated in recreation investmentse 
Recreation in its current dimension is a rather recent 
phenomenon, certainly in large part stemming from the post 
World War II periode As suchy the relationships have not 
been as stable as relationships existing for many 
commodities in which demand studies are often made, such as 
fooa products, apparel, automobiles, or applianceSe There 
are within outdoor recreation activities degrees of faddism 
and rapid changes in taste, sometimes owing to changes in 
technology as, for example, in motor boatinge That the 
general demand for outdoor recreation will be increasing 
throughout the foreseeable future is undeniable, but 
estimating the changes in the components of the aemand is 
going to pose continuing ditficultye The study should be 
cesigned to yield some insight into these problemse 

The study Should also proceed so that further 
investigations can be carried on which wilt complement the 
initial or early studiese That iS, the recreation demand 


study cannot be a “once and for all" study but should be a 
continuing program of research and planning with a series of 
new studies of updating data, incorporating new and 


continuing analysis; and exploring different spectrums of 
the recreation demand questione 

The simple collection of facts and figures is not 
enoughe The end should be an orderly classification and 
explanation of the interrelationships inherent in the 
fieurese This explanation should be the central goal of a 
continuing program ot recreation demand studieSe 


TEE STUDY PROGRAM 


the aemanda study should be performed as a sequence of 
indivicual studies or taskSe These should be coordinated and 
scheduled in such a way that the information needed for one 
task is penerated py the preceding work, and should build 
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from work already performed or underwaye 
The major data generation phases will need to be 
initiated firste This will involve work on: 
1- household surveys, in which recreation habits 
of Canadians wiltl be determined; 
2e park and recreation user surveys, to ascertain 
the nature of use of recreation areas of all 


types in all regions, together with 
characteristics of the users; and 
3e recreation supply inventories, which will 


assess the extent and nature of recreation 
opportunities available to Canadianse 


The subsequent phases will deal with analysis of the 
data and the determination of demand parameterse The program 
should Lead to an assessment of relative demands and related 
needs and to the implications for planning recreation 
programs, policy formulation, and development strategy not 
only for the National Parks; but also for other recreation 
areas provided by all levels of government and private 
agenciese 


Task 1: Recreation Participation Screening* 


This portion of the study will focus on 
participation rates among various population 
seaments in all regions of Canada It will yield 
information on the proportions of the population 
which take part in recreation activities and 
visits parks of different kindse This should 
provide initial useful data and be invaluable for 
structuring Later taskse 


This study should be performed earlye It cany in 
view of the lack ot availability of Statistics 
Canada surveys; such as the Labour Force Survey, 


be most expediently done by existing market 
research firms who have well established 
population probability samples covering all 


segments and all regions of the countrye These 
surveys, which are commonly done twice a year, are 
designed to include a range of questions serving 
various clients and can be augmented to include a 
small number of questions relating to outdoor 
recreation participatione It is of the highest 
priority, in terms of timing; the scheduling of 
later studies and the necessary sequencing require 


1e The tlow chart for these tasks, prepared by Knetschy, 
alonzy with relevant information on which taskS were 
carried out when, are found in Volume Ie One may wish to 
Look at Figure 1 and Tables 1 to 5 of Volume Ie 
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that it be done in the fall of 1967. 


Task 2: A Study of Motivational Factors 


DESIGN 


This study would attempt to isolate various 
motivational tactors which are associated with 
participation in outdoor recreation activities 
among people in different socio-economic and 
locational situationse Activity participation 
rates vary enormously between different 
individualse Part - can be accounted for by 
differences in the opportunities availablee Part 
can be explained by socio-economic variationse Yet 
major differences remaine The task study would 
seek to discern the importance of personality and 
other motivating tactors which are Linked to 
preferences for outdoor recreatione This can also 
best be done under contrast with existing market 
research ftirmse It would call for depth interviews 
with recreation activity participants and non- 
participantse It would be expected to provide 
direct input as @uidance for structuring the 
household survey (Task 3) and to yield significant 
insight into the meaning of responses obtained in 
this and other segments of the overall studye it 
may well also have important planning and policy 
implications of its owne 


3: Household Survey 


This study would be designed not to yield 
estimates of demands as such, but to give 
information on the habits and preferences of 
Canadians and on comparative participation rates 
among the population for various activitiese It 
would be used to gain far more insight into the 
proportions of participants and non-participants 
among population segments than the initial 
screening study and would explore preferences of 
what people Like to do, isolate factors which are 
associated with engaging in outdoor recreation 
activities, and investigate reasons for 
nonparticipatione Variations in participation 
among very broad regions will be investigated, 
although the sample size will necessarily be too 
small to make meaningful interplace comparisons 
amone small regionse This is a fundamental and 
major weakness of this technique —- to Link 
aaequately use or participation rate ditferences 
with ditferences in recreation opportunitye in 
this study, insight would also be wained into the 
demand for activities for which there is Little or 
no opportunitye 
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DESIGN 


There are many problems associated with these 
household surveyse The cost can be excessive if 
large samples and personal interviews are usede 
There are problems associated with mail surveys 
and with phone interviews as welle Furthermore, 
recall of number of days of participation or the 
degree of participation poses an additional 
problem in single interview surveysSe 


The use of survey panels of large numbers of 
individuals in all population strata in all 
regions of Canada which are maintained by market 
research organizations, otfers a possible 
opportunity to minimize some of the disadvantagese 
For the immediate purpose, such panels, which can 
be resurveyed and are fairly representative of the 
entire population, may provide the most feasible 
alternativee If such a panel interview study is 
conducted, or if any other method is chosen, the 
consultation of Statistics Canada should be made 
an integral part of this phase of the worke 
Further work of this nature should also be 
coordinated with the Canadian Government Travel 
Bureau and the Canadian Wildlife Service, each of 
which have active program plans in the areae The 
design of the household survey should be carefully 
done and reflect the results of the initial 
screening and the motivational factor studiese 


ARDA Land Inventory 


The next series of individual tasks relate to 
supply identification and structuringe 


The demands for recreation opportunities need to 
be assessed against the background of existing 
areas and facilitiese Details will be required on 
the type, extent of facilities and development, 
location, size and degree of access of recreation 
areas in Canadae The operation must have a high 
degree of uniformity of view and of classification 
across the whole countrye The extent of the data 
collection can vary more than the classification 
scheme to be used and the kinds of data to be 
collectede A useful initial guide to classifying 
areas may be the existing Federal Provincial Park 
Classification Schemee 


Inventories are ditficult to structure owing to 
the Lack of standardization, the imperfection of 
quality determinants, the many individual 
attributes and dimensions of park and recreation 
areas, and the existence of a wide spectrum otf 
types of areas and facilities to be includede 
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However, it is necessary to have available in a 
manageable data system information on existing 
present recreation supplye The demand study, by 
making explicit the supply data requirements, 
shoulda aid materially in defining which inventory 
data should best be collected and its most) 
advantageous forme The inventory work should 
proceed so that it continuously and successively 
builds on the initial data collected and 
assemblede The more complete the inventory; the 
more utility it has; but an incomplete inventory 
is still highly useful. 


This task is concerned with the utilization of the 
Canada Land Inventory information system and with. 
structuring of the supply determination efforte 
The initial task is to utilize and adjust the data 
to the Canada Land Inventory computer mapping 
programe This work is in advanced stages of. 
completion and should be utilized as the basic 
framework for the inventory datae This program 
should provide a most useful base, first because 
of the classification data that may be a useful 
part of the inventory itself, and second, because. 
of the associated development of map and computer. 
techniques which appear to be invaluable to the 
organization of inventory and supply datae All of 
the inventory information should be keyed to the 
computer mapping unit by the utilization oft map 
coordinate datae 


Task 5: Jnventory of National Areas | 


This task can be well done within the Federal. 
agencye It involves an inventory ot existing 
national areas relating to outdoor recreation. 
supplye These data should be classified by type, 


characteristics, and Locationse The initially 
required information should be readily availablee 
1t should be tabulated for each area with. 


appropriate Linkage ior assembling into the data. 
bank of the Canada Land Inventory computer mapping 
operatione | 


i 


Task ¢: Inventory of Frovincial Areas 


This task will need to be done by appropriate. 
apencies ot the individual provincese The 
inventory task is similar to that for the national 
areas anc each shoulda maintain a high aeyree of | 
complenentarity to the otherse While the 
inventories will need to be continually appended, 


the initial phases of the necessary assessment of. 
| 
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supply should be compiled within the first yeare 


Task 7: Inventory of Local Areas 


The work contemplated in this task would 
supplement the other inventory work by providing 
information on many of the remaining recreation 
areas and facilities not included by national or 
provincial agenciese Many of the areas to be 


included are urban and regionally orientede 
Private areas, to the extent that areas providea 
are of a nature that they directly and 


Significantly affect the use of public areas, by 
their effect on the relevant supply, Should also 
be inventoriede 


Much of the work can be done by local agencies, 
with the possibility that some of it may be more 
suitable for outside contracte In no sense would 
the inventory be a complete one, particularly in 
the initial phases of the program, but even 
partial information on many of the areas would add 
materially to the necessary assessment of outdoor 
recreation supply in Canadae 


Task 8: Coordinating Supply Data 


DESIGN 


(= 


This work, to be largely done internally, is one 
of coordinating the data collection and increasing 
the compatibility of the data collected from 
various sourcese The various surveys or 
inventories may require somewhat different 
techniques and operations to acquire the desired 
datae The major objective of this task is to 
reduce the disparities into standardized format so 
that the data can be related to other survey 
results in the demand studye 


eview of Existing User Surveys and Data 


This task would consist of an assessment of 
ongoing data collection programs involving 
recreation use of various areas and facilities and 
recreation travele To be included, beside the 


consiaerable efitorts of park ang recreation 
a.~encies in all parts of Canada, are the work of 
the Roy Wolf Associates —- DBS study ot recreation 


Sstatisticse Also of primary interest would be the 
highway origin and destination surveys, which are 
a continuing part of the highway and road 
development programSe 
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The applicability of the relevant UeSe surveys 
should be reviewed at this juncturee These would 
include any information that would supplement the 
Canadian surveys in the 1960 National Recreation 
Survey of  ORRRC, the Later one sponsored by the 
UeSe Bureau of Outdoor Recreation, and any surveys 
conducted in border statese The usefulness ot the 
data and surveys should be explored but witl 
likely be Limitede One study, currently underway 
at Rutgers University, may prove ot benefit 
because of the attempt to relate the participation 
data to supply characteristicse (see TN 34 for a 
discussion of their studye ) 


Task 10: Nationat Park User Survey 


DESIGN 


This task, and the three to follow; focus on the 
collection of primary data on use made of existing 
areas and facilitiese The purpose of this work is 
to collect data on the use and users of the full 
range ot types of recreation facitities in alt 
areas of the country; to assess the amount, timing 
and patterns of use of these areas and the 
characteristics of the userse This series of 
surveys will provide a major portion of the basic 
data for establishing recreation use 
relationshipse This will enable predictions to be 
made of the consequences of varying the types or 
quantities of recreation activities or facilitiese 
The work will consist of isolating recreation 
areas and tacilities throughout the country in 
which a full range of the array of important 
dimensions otf characteristics ot recreation areas 
are covered, including types of areas, locations 
of areas, and administration of arease 


It is important that reliable total visit 
statistics be obtained tor each ot the areas 
investigatede Cut of the totals a very small 
sample of visitors are interviewec using a short 
questionnairee The Samples need only be drawn on 
perhaps less than eight or ten cays throughout the 
year, with only a fraction of the visitors’ on 
those days actually guestionede The questions 
asked are Similarly Limited, with average 
interviews lasting one or two minutese The major 
concerns are with the trizt origin, purpose, and 
Limited intormation about the partye Efforts of 
this level in other studies have proven very 
Satisfactorye Fully adequate methods are available 
which are relatively inexpensive, entail a minimum 
of sdatefhiitcu tity at the site and to the park user, 
ana yield meaningtul resultse The purpose of this 
task is to implement a system ot pathering the 
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relevant data on visits and visitors to the 
national arease 


Task 11: Provincial Recreation Area User Surveys 


The work to be accomplished in this task is to 
collect data on use of provinciat areas parallel 
to that described for national arease As the 
areas are more dissimilar and the data collected 
by dittferent agencies, it is necessary that 
attention be given to careful design to assure 
that the results obtained from the different areas 
are fully compatiblee 


Task 12: Local Recreation Area User Surveys 


The objective sought in this task parellel those 
of the preceding two series of surveySse The 
problems of uniformity are more severe and other 
problems will be more acute, but the requirement 
to obtain user data on local recreation areas 
remainse A representative range of condition of 
areas, facilities, and Locational proximities 
should at the least be surveyed in each major 
regione These areas may well be among the most 
important in the total spectrum of outdoor 
recreation and have a direct bearing on the use of 
other areaSe 


Task 13: Border Exit Survey 


DESIGN 


An important sexment of use of many areas; 
particularly in certain regions of the country; 
originates with visitors from outside Canada - 
primarily the United Statese Among the surveys of 
use made of tacilities should be included a border 
exit survey ofa sample of nonresidents Leaving 
the countrye The information to be obtained would 
include the use which non-residents made of 
recreation areas within Canada, the type and 
Location of areas visited, the number of times, 
and the purposeée This would afford a means of 
agqding the non-resident dimension to the total 
demand picturee The border survey should be done 
in complete cooperation with the Canadian 
Government Travel Bureaue It can probably be most 
eificiently uone with contract servicese 
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Task 14: Participation and Supply Analysis 


This and the following segments of the recreation 
demand study consist of data and program analysis 
sectionse In eachy the major efforts of data 
processing and analysis should be centered in one 
location, although parts may well be contracted 
out or otherwise separatede In each case duplicate 
data tapes should be made availablee 


In many ways the problems become more complicated 
aiter the data has been collectede While there is 
a good deal of theoretical work to draw on that is 
directly applicable, the attempts to quantify: 
recreation demand have been of far more Limited 
scope than those envisioned heree Attempts would 
be made in this task to gain insight into 
participation in outdoor recreation activities by 
ditferent segments of the whole populatione Data 
from the motivational study, the household 
surveys, and the inventory data, with some input. 
for the user surveys, would be usea to tormulate 
broad relationships between participation rates 
and regional supplye Socio-economic factors. 
measure to some extent the physical capacity to 
engage in recreation pursuits and the propensity 
to participate, but this is not enough for 
rational planninge The supply dimension, to be 
examined in this task, neeas to be taken into 
account in explaining the observed use ratese | 


This task would use as primary data the results of! 
the user surveys, plus the supply and 
participation analysis taske It is a major element 
in the entire studye | 


Use prediction models would be formulated, 
empirically relatinz recreation area use to the 
various factors contributing or detracting from 
this use, including such things as, the influence 
of preximity to population centers, the size of. 
the areay, the facilities aueveloped, and the 
alternatives available in the same areae Also, 
degrees of substitutability be tween recreation 
areas would be ascertained utilizing this datae 


Estimates ot how much may be expected to change 


over time, Ziven changes in the population 
characteristics, and Ziven changes in the supply 
contigurationy would be tormulatede The 


relationships established in the data analysis 
work cf this task would form a primary means of 
assessing the outdoor recreation demanas in 
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Canadae 


Task 16: Assessment of Relative Demands and Relative Needs 


The next analytical section of the overall study 
is concerned primarily with the assessment of 
relative demands and relative needse Demand is not 
an absolute, but is a functional relationship 
between population characteristics and recreation 
opportunitye The whole outdoor recreation demand 
study will focus importantly on the assessment ot 
relative priorities and investments and policies 
established on the basis of retiative deficiency of 
supply to demandse The chief data inputs would 
come from the preceding analysis studies and the 
inventory data in the initial surveySse 


Task 17: Development of Policy Programs and Strategy 


The purpose of this task is to pull together the 
data which has been generated in the prior tasks, 
and to formulate the recreation policy 
implications, the recreation development program 
and the broad strategy for coping with the 
indicated recreation demandSe There will § be 
continuous tlow of intormation useful for planning 
from many ot the earlier steps, but the largest 
payotf will occur at this stage of the demand 
studye 


It is anticipated that the program of studies 
outlined will provide the necessary data input for 
outdoor recreation planning processes to meet the 
multiple needs of the various agencies in Canada 
charxed with recreation responsibilitiese 


‘ask 18: Continuing Data and Analysis Program 


DESIGN 


Planning tor the provision of outdoor recreation 
opportunities is a continuous processe Similarly 
the data program and analysis should continuee 


The initial round of surveys, data processing, and 
analysis can be fully expected to yield inmensely 
useful planning, investment, management, and 
policy guidese The utility of this will increase, 
however, as the data becomes more complete, the 
survey operations and data analysis more routine, 
and the estimates more extensive and precisee 
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PROGRAM TIME, COST, AND STAFF 


The initial round of studies should be initiated in the 
current fiscal year and programmed over the three fiscal 
years of 1967, 1968, and 1969e The efforts required will be 
substantial in terms of direct funding, staff of Parks 
Canada and that of cooperating agencieSse There are various 
trade-ofts that can be accomplished among these, but there 
are significant economies in having the work proceed in the 
manner describede 


The demand study outlined can best be programmed in the | 
following manner, with the professional staff requirements 


of Parks Canada and the direct outlay of funds which will be 
needed to carry out the individual tasks noted for each 


segmente These estimates of staff time and funds can vary . 
ereatly and should be regarded as very approximate at this © 


stage of program planninge 


Tasks to be initiated in 1967 include the following: 


Task 1 - work on the preliminary recreation 
participation screening should begin in the fall 
ot tiscal year 1967. It is anticipated that staff 
time required would be approximately one halt 
month, the cost in this year, $15,000. 


Task 2 - the motivational study, should be 
initiated during this fiscal yeare It is 
anticipated that this would involwe one half 
month, and $15,0006¢ 


Initial inventory coordination work should be 
bezgune This would be in connection with Tasks 4, 
59 69 and 7e The staff requirements here may be 
expectea to be approximately two months, excluding 
work by the personnel at ARDA, and funds would be 
approximately $5,000- 


The review of user data and statistics, Task 9, 
Should be carried out during this fiscal yeare The 
time and costs requirements are anticipatea to be 
three months and $2,0006 


The initial planning for user surveys should also 
be initiated during fiscal 1967. It is anticipated 
that this would take three man months of staff 
time, and approximately $9,000 in expenditurese 


The total staff time requirements for the projects 
which will need to be initiated auring the current 
fiscal year are approximately nine man months3 the 
total tuna requirements are anticipated to be 
$46,00V06 
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The work to be undertaken in fiscal year 1968 includes the 
following: 


The motivational studies, Task 2s should be 
completed very early in fiscal 1968. This will 
involve one man month of staff time and $10,000-~ 


A household survey should be initiated during 
fiscal 1968-6 It is anticipated that the Canadian 
Wildlife Service, the Canadian Government Travel 
Bureau, Statistics Canada and perhaps others, 
would be involved in this efforte Statf time 
required of Parks Canada is anticipated to be four 
man months; expenditure of funds is expected to be 
approximately $50,000- 


The inventories should be carried alonge Staff 
time required during this year is four man months; 
the expenditures needed may total $8,0006 


The user surveys; Tasks 10, 11, and 12, should be 
carried on in an extensive manner during fiscal 
1968e This will necessitate substantial staff time 
and includes expenditures for computer time, road 
counters, and supplemental field personnele The 
stait time involvement is anticipated to be 15 
months, and the expenditure $60, 000¢ 


The border exit survey should be carried out 
during tftiscal 149686 This work, Task 13, will 
involve, in addition to the Canadian Government 
Travel Bureau people, one man month of time of 
Parks Canada, and perhaps $5,000 in fundse 


The analysis of participation data and user datay 
Task 14, should be well underway in fiscal 1968-6 
This will require 13 man months of staff time and 
$25,000 in expenditurese 


Task 15, the formulation of use relationships 
should be initiated during this periode This 
eftfort will require 2 man months and $3,000-¢ 


The total statf time required in fiscal 19608 is 
approximately 40 man months; the total 
expenditures of funds, $161,000- 

The work in fiscal 1969 should include the following: 
The user surveys, including border crossing 
survey; should be continuede This will involve ten 


man months and $30,000 in fundse 


Inventory accumulations and retinements should be 
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continued in 1969-6 This will involve three man 
months and $10,000 in expenditurese 


The analysis tasks should be well underway in 
fiscal 1969-6 These would include the analysis of 
participation supply data, Task 14, the analysis 
of visitor use data and formulation of use models, 
Task 15, and the assessment of the relative needs 
and demands, Task 16.¢ This work will involwe 22 
man months, and $30,000 in fundse 


The development of program strategies and policy 
implications will be carried on in fiscal 1969. 
This will involve five man months, and an 
expenditure of $20,U000~- 


The total for 1969 for the work anticipated would 
involve approximately 40 man months and 
expenditures of $90,0006 


The total expenditures anticipated in the initial three 
year effort in the series of demand studies would be $46,000 
in fiscal 1967, $161,000 in fiscal 1968; and $90,000 in 
fiscal 1969, or a total over the three year period of 
$297, 0006 

The demana studies are designed for a continuing 
pregram of surveys, and continuing development of the 
analysis is anticipatede The subsequent cost might run 
aprroximately $30,000 for the surveys, and another $30,000 
for the analysis and development work, or something in the 
order of $60,000 in fiscal i1970- 

The program which is envisioned will require that Parks 
Canada acquire additional staff whose major responsibility 
is the coordination ot the demand studies and major 
participation in the analytical worke It is highly 
desirable that the personnel undertaking this work have 
qualifications in analytical methodse This would include 
econometrics, statistics, and familiarity with computer 
techniquese [It is anticipated that four people should have 
major assignments over this period ot time to work on the 
demand studye 
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CORD STUDY TECHNICAL NOTE 34 


N D ION 


This paper was prepared by Knetsch in 1969-1970. At 
that time, the Canadian Outdoor Recreation Demand Study 
‘analysis Group within Parks Canada was being formed and such 
a paper was needed to give guidance in the analyses of the 
CORD Study datae Presenting this Working Paper 1n (a 
slightly edited form atfords researchers and research 
analysts opportunity to develop insights into how ideas have 
ct and how quantitative advancements have been madee 
: In the view of major contributors to the CORD Study the 
Working Paper was (and still is) a useful and provocative 
‘document for understanding the thrust of continuing research 
rrortse This edited version includes references to a 
‘number ot Technical Notese These Notes concern analysis 
‘efforts carried out long after the original paper was 
preparede The references are offered to the reader as 
evidence of Knetsch's insight into the direction that 
antlysis had to goe Certainly, if the reader accepts’7 the 
_evidence, the paper provides a fitting introduction to the 
‘rest of this volume. 


| WORKING PAPER: OUTLINE OF DATA ANALYSIS 
ASSESSING OUTDOOR RECREATION DEMANDS 


| JeLe Knetsch 
(with editing and updating by Je Beaman) 


ABSTRACT 


| This paper was originally a series of notes on how the 
~CORD Study research should proceede It expands on an 
earlier Paper by sugvesting how to carry out tasks which 
‘were called: (14)Particpation and Supply Analysis and 
-€15)Use Prediction Analysise 

Initial discussion otf what is meant by outdoor 
recreation demands is presented to provide a rationale for 
further discussione Some of the major analyses that are 
Proposed are then outlined, using examples to illustrate the 
methods of analysise As well the presentation contains: 
i@1l da set of points on the use that can be made of the 
results of the investigations proposed, and (2 )some 
indications of future research directions that might be 
considered when the main approaches suggested have been 
followed. 
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THE NATURE OF DEMAND 


There is but Little doubt that one of the most 
imrortant and productive tools of economic analysis is the 
notion of demande It can be of immense usefulness in 
determining consistent and rational outdoor recreation 
policies and in determining how we can best utilize and 
manage recreation resourcese If demand relationships can be 
understood with even a modicum of success, various kinds of 
productive analyses are possible that provide at least 
partial answers to many difficult problemse This is not to 
sugeest, aS was pointed out in the original design paper, 
that demand analyses can provide clear-cut answers’ to 
planning problemSe They cannote Rather, it is to suggest 
that demand analyses can provide important information 
useful for planning and policy decisionse 

There has been much concern with the trends of 
recreation use of various facilitiese in part, these reflect 
increases in the demand for outdoor recreation facilities 
and arease There has been considerable ambiguity connected 
with the usual statements used in explanation otf thescee 
Further, knowledge oft these trends does not solve the 
problem of how best to accommodate the aemands for outdoor 
recreatione In fact, parochial recitation of the number of 
recreation visits accommodated each year clouds the hard 
issues of how much recreation, and of what sort, should be 
providede (For trends, see TN 13 and TN 22-¢ ) 

There is aurther coniusion between the statements of 
recreation consumption (which are the number ot visitors we 
observe) and the demand for a commoditye In a similar vein 
there is otten confusion between the need for commodities or 
services such as outdoor recreation and the demand for ite 
Neea,y aS otten expressed, designates a capacity ot some sort 
judged to he adequate to accommodate some forecast fora 
g&iven target yeare Often funds are appropriated to 
individual yurisdictions in accordance with such estimated 
needSse However, this can have a perverse influence on the 
rational provision of outdoor recreation facilitiese Demand 
for, or more correctly the use of, a tacility is often far 
more a function of its availabilitye Increasing the 
ovprortunities, therefore, may merely stimulate more ot the 
same type ot facility usee 

This type of difficulty pervades not only recreation 
planning but other public expenditure catejories as welle An 
example of equating need with use forecasts is that of the 
higrhway program in the UeSey where a highway travel report 
estimated that there is a "need" to devote J320—-—billion 
dotlars to road construction over the next iS yearse It 
would have been more useful to ask what parts of this total 
highway "need" oitfer a return sufficient to justify 
expenuitures of public fundse In other words, with a clear 
determination of the demand, one coula make a more rational 
estimate of the ditferent consequences ot alternative 
actions or levels ot Spending —- tar more usetul intormation 
On which to base highway planse 
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The most important characteristic of demand, and how it 
is to be used in recreation planning, is that it is a 
conditional or functional statemente That is, it is not an 
absolute number but a magnitude dependent upon the Levels of 
other factorse In general, demand for any commodity springs 
from the notion that a consumer making a purchase does so 
because he feels that it will serve some real or fancied 
usee Consumers express their wants and desires by expressing 
different demands for different commoditiese The consumer 
often desires more of many goods and services, but is 
constrained by income levelSe Consequently, his demands for 
different commodities express relative preterences among 
commodities, given income constraintse In an economic system 
based on market forces, knowledge of these demands serves to 
inform producers of the relative quantities of different 
commodities that can be profitably marketed. 

The demand for commodities is the amount which 
purchasers choose to buye However, this amount is determined 
by factors that Limit demande For most goods and services, 
the lower the price the more will be demanded by any single 
individual and by the market —- the cotlection of all 
individualse 

The income of individuals is also a factore Normally, 
and there are exceptions to this, the hiyher an individual's 
income the greater the quantity of individual goods and 
services he will demande A third factor is the price of 
closely related commodities, the substitutes and complements 
of the goods in questione The demand for goods can be highly 
sensitive to changes in the price or the availability of 
other goodse In the case of other, the demand is nearly 
invariante Necessities characterize the Latter groupe 

A fourth factor that must be considered is the tastes 
and preferences of the individuals, in other words how they 
mare up their minds, their attitudes, their adherence to 
fashions, customs and so forthe Tastes and preferences are 
usually taken as "“civen" in economic analyses with a focus 
on the consequences of economic factors that influence the 
demand for any BZiven commoditye The concern is with HOw 
consumers make up their minds in terms of the quantities 
which they demand of different goods and services, not with 
WHY and how they make decisionsSe With most analyses there is 
no particular interest in explaining choiceS, but merely in 
determining what the reaction is and how it will change, 
cerending upon change in economic factors of price, the 
availability or price ot substitutes, and incomese Consumers 
make choices amonw different commodities and when 
circumstances change, their choices reflect thise The 
Patterns of choice are particularly apparent where consumers 
are viewed as groups, minimizing the influence of individual 
ageviationse 

In the case ot outdoor recreation, the principles just 
outlined still supplye The ‘difference has to do with the 
individual conditionse Tastes and preferences of 
individuals are a tactor with regard to recreation just as 
they are with regard to the demand for any commoditye The 


\ 
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income of individuals is Similarly a factore Higher incomes 
usually result in greater demands for most recreation 
services, although here again there are exceptions and these 
may be important to Know aboute Skiing is undoubtedly highly / 
responsive to income changes; hunting is likely to be far | 
less so and may even be negativee Regarding alternative 
commodities, the relative availability and accessibility of 
substitute recreation areas or other substitute demands for 
time and money have an effect on the demand for any given 
recreation site or any recreation opportunitye 

It is in the area of price that the demands for outdoor 
recreation and the demand tormulations for most commodities 
differe However, it is a difference in particulars and not a 
difference in principlese The role of price vis a vis 
recreation is usually coverte For a variety of good reasons 
we generally make outdoor recreation opportunities available 
publicly —- outside the market conditionse Although the 
private market does exist, it usually does so for a fairly . 
smell range of recreation facilities and services, with the 
bulk being provided publiclye 

Fees and charges, even though used to a certain extent 
in public areas, are not normally prices which serve the 
usual market function of allocationse Instead, in recreation 
the price variable (in terms of a demand analysis) largely 
pertains to the relative availability or accessibility of 
recreation opportunitiese This availability is otten 
measured in terms of locational proximity although other 
measures or determinants of availability exist as welle (see 
TIN SPY TNT s2TN 268) 

Thus when one speaks ot the demand for outdoor. 
recreation, the statement is one of “condition" and is, in. 
principle, similar to statements about scarce commodities 
generallye The tactors determining them remain the same in 
principlee It is only in the particulars that the 
formulations daiffere The role of proximity to "need" is far 
more important than for most consumablese This partiaily 
stens from the non-market nature ot the demandSe In part, 
however, it also stems trom the immobility of most resources 
used for recreation purposese This is a characteristic 
somewhat peculiar to recreatione kecreation activities are. 
consumed at the site rather than purchased and consumed more 
ubiquitouslye 


AGCKREGATE DEMAND ANALYSIS 

Given the general nature of demand, and in tne specific 
centext of cutaoor recreationy a series ot what might he 
termed aggrecate demand studies can provide important | 
insight intc the cheracteristics ot the dGemana tor various | 
kKirds of outdoor recreation servicese The emphasis of such | 
stucies is upon the determination and explanation of the 
variation in the amount of outdcor recreation uemanded py 
ditferent segments of the populatione (See TN 6, TN 125 
Loy TNyv + 2045. 2N 7295 TN 33y and TN J6e) Such studies difter 
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from investigations concerned more with the demand for a 
certain outdoor recreation facility, or a given recreation 
area such as a parke (see TN 1y TN 4, TN 7y TIN 8y IN 11, TN 
14, T™N 18, TN 19, TN 30, TN 31, TN 333 ™N 34, TN 35, and TN 
38e) Studies also focus on allocation or setting up economic 
accountse (see TN 5, TN 175 TN 255 TN 263 TN 32, TN 39_, TN 
40, TN 41.) 

The major intent of aggregate studies is to relate 
participation in the various forms of outdoor recreation to 
a range of socio-economic variables and supply 
characteristicse The basic analyses consist of obtaining 
data on Participation of different segments of the 
population in the different forms of outdoor recreation and 
then attempting to explain these variations in participation 
rates in terms of both the different supply characteristics 
facing these different individuals and their differing 
socio-economic attributese The underlying notion is that 
the rates at which individuals in the population participate 
in various forms of recreation is dependent, first of all, 


‘upon such things as their income levels, their education and 
so forth, and also upon the availability of opportunities of 


various kindse For such analysis to be meaningful 


information is needed on a cross-section of the population 


to identify the characteristics of the individuals and the 


rates at which they participate in different recreation 


activities, as well as something of the different 
oprortunities that may confront theme For example, one 


normally would expect to find that the rate at which 
'ditferent members of the population participate in skiing is 
|somewhat related to their income levelse This is explained 
| by the necessity to purchase equipment and other expenses 


-attendant on this type of activitye So normally we would 
Lind that higher income levels are associated with higher 
rates of skiing activitye Similarly, we would also expect to 


tind that the visits to (for example) National Parks are 
agependent upon the distance of different individuals from 
such areaSse In aggregate demand studies it is the 
characterization of these relationships which are of primary 
intereste 

These studies are useful in examining the population as 
a whole and the variation which this population exhibits 
with respect to its outdoor recreation rates among 
Pporulations living in different parts of the country; in 
rural areas versus large metropolitan areas; and between 
high incomes and Low incomeSe While socio-economic 
Characteristics may be predetermined and tixed for most 
Planning questions, supply characteristics are not, nor are 
they policy determinations for providing outdoor recreation 
in the futuree Participation information can be expected to 


be useful for deciding a number of broad planning and policy 
-questionse For example, it it is found that individuals of 
given Locational or socio-economic characteristics are 
Participating at far lower levels in certain activities than 
others, there may well be an ettort made to make facilities 


more available to them, assuming the Lower rate does not 
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retlect Lower preference for that activitye (On the subject 
of CORD Study supply data, see the CORD Study Data 
Documentatione ) 


DEVELOPMENT OF DEMAND ANALYSIS INQUIRIES 


Demand analysis can proceed on two basic frontse In the 
first, the emphasis is placed on fairly simple and 
straightforward tabulations and cross-tabulations of 
participation in various forms of outdoor recreation and the 
ditterent characteristics of the populatione Some of this 
has been done in the case ot the original report from 
Canadian Factse From this series of tabulations a number of 
gress indications of differences in participation rates 
becomes evidente In many cases the explanations may be 
fairly readily understandable, while in others they will be 
less soe In pary they may be attributable to differences in 
characteristics of populations and in part they are due to 
differences in supply availability e Large differences will 
remain as yet tairly indeterminatee Still, it is useful to 
know, tor example, the participation rates in Large cities 
as compared with participation rates in small cities and in 
rural areaS, and also the variation in participation in 
different activities in different provincese (See tne 
discussion of the CORD Study National Surveys in Volume III: 
Data Collection and Documentation )e 

A second type ot analysis relates the differences in 
participation to various associated factors in a more 
systematic and analytical mannere The best example of such a 
stuay as ot 10269 was that reported by Cicchetti, Seneca and 
Davidson, hereaiter referred to as the Rutgers!’ Studye For 
elaboration of that work one may see Cicchetti's new booke 
The data ot the Rutgers" Study were obtained from a most 


ambitious data collection programe Unfortunately, the 
scarcity of adequate suprly data and the meagre ability to 
formulate meaninzful supply variables proved a severe 


ceficiencye The supply information used was very crudee 
Nonetheless the study did utilize several supply variables 
which provided meaningful answers andy, perhaps more 
imrortantly, provided a demonstration of a methodology that 
could be applied to the CORD Study programe The Rutyers! 
Study set out to explain participation rates using a two- 
step procecduree First, a "conditional probability" for 
participation in a Specitic activity was aeveloped in 
relation to socio-economic and supply variablese (see TN 12 
anc TN 2% tor Similar COkv Study analysise) The aim was to 
aetermine it a respondent was a participant or nots that 
iss whether he had, during the course of the year preceding, 
enpaxved in particular outdoor recreation activity for which 
a rredictive equation was developede There was no attempt to 
inticate whether or not a Single individual would 
Participate nT, S Riven recreation activitys5 instead the 
focus was on the probability of his being a participant 
aderendent u;on the Level ot the explanatory variablese 
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Data was readily at hand for the socio-economic 
variablese For the supply variables, it was necessary to 
depend on a crude set of variables obtained from available 
‘inventory work'. Some general inventory results were the 
supply information for the empirical worke There were some 
data on the number ot facilities of various kinds by country 
throughout the UeSe To use these datay respondents were 
identified by country of residence and thus inventory data 
were Linked with the county of the respondentse This type of 
supply variables included:d: distance from large bodies of 
water (by 50 andi100 mile groupings), recreation land 
acreage, wetland acres, water acres, water acres by class, 
and acreage of various Bureau of Outdoor Recreation 
classified landse 

There were also data facilities ona state-by-state 
basiSe These proved to be less useful than those by county; 
but some of these data were usede They included the number 
of swimming pools, thé number of sports establishments; the 
number of golt courses and numbers of other commercial 
establishmentse 

In the Kutgers! Study, the probability of a participant 
engaging in swimming was found to be dependent upon the 
following variables ina statistically significant way: 


age of the respondent, race, SeXy, size of the 
population unit in which the respondent usually 
resides, education tlevel, family income, owner or 
renter of the domicile of usual residence, size of 
the family, number of children from ages 6 to il, 
region of the country, occupation of respondent, 
class ct work of the sample person (self-employed 
or not), occupation status (blue collar, white 
collar), population of the sample unit, population 
of the state, distance from the ocean or Great 
Lakes, receipts of recreation and amusement 
establishments in the area in which the respondent 
Lived, number of commercial recreation facilities, 
population density, median income of the statee 


If alt members of the population were faced with 
approximately equal supply configurations, supply variables 
would not be expected to be significante However, this is 
not likely to be the case for most forms of recreatione For 
those in which access to an opportunity is an important 
determinant, and for which the spread of the opportunities 
is uneven, supply variables take on increased importancée 
(see TN 296 ) 

Having determined the conditional probability of 
participation the analysis then proceeded to a second step 
cf determining the dezree of participation tor all of those 
individuals previously identitied as being participants ina 
Particular activitye That is, given that their activity 
level is greater than zero, all respondents qualifying on 
these grounas were then subjected to a separate analysis in 
which the level of participation was analyzede A series of 
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variables was related to different participation Levels and 
an attempt was made to establish significant relationships 
between these variables and variations in participation 
levelse 

This two-step method proved to a very useful onee It 


overcomes many ot the problems that occur when there are a 


large number of non-participants in an activitye 
Furthermore, it allows for ditterent equations to be 
calculated to explain different levels of participantSse It 


would be expected, for example, that different variables 
would explain whether a person is a participant or not and 
determine the level at which a participant engaged in an 


activitye Not only would there be different variables that 
would Likely be important, but also the relative importance 
ot variables that were important in both cases’ should 
differe There were various difficulties encountered with 


explanation of the sign ot individual variables, whether a. 


factor contributed positively or negatively, but on the 
whole the analysis proceeded in a fairly straightforward 
mannere 

The kutgers’*® Study was thus successful in establishing 
a useful methodologye It was also useful in pointing out a 


number of empirical findings that will be of great interest 


to planners dealing with recreation problems, although the 
degree ot explanation was in some cases aGisappointinge 


Future analysis attempts should prove to be more successfule 
For all ot the different recreation activities investigated 
such factors as agey income Levels, racey SCX, education, | 
home ownership, family size and composition, region of the. 


country, cccupational class, population size in the local 
area, and distance from major recreation opportunities were 
founa to be the most sSignificante (For similar resultS, see 


the Appendix to IN 12, available through the LEISURE STUDIES. 


LATA sANK, Waterlco Research Institutee ) 


Given some quantification ot the relationship between. 


participation in the various recreation activities and 
various factors that are found to be associated with then, 
projections of future recreation demands are possiblee Such 
estimates can be made from projections ot future populations 
and their socio-economic characteristics, given some 


assumption concerning the future supply configuratione The. 


\ 


most useful of such projections would be those obtained 
under ditferent levels of future supplies of recreation) 


facilitiese Comparisons of future activity under agitterent 


) 
essumed supply conditions would give some quantitative. 
information to policy planners in assessing current. 


aevelopment and also in deciding among LevelS and types of 
future expansione (see TN 23 T™N 143 and TN 29 in’ 
particulare ) 


In the kutgers! Study, two Levels of future demanas) 
were usea to illustrate the methoas and resultSe The trirst 
assumed a future pace of expansion of supply of recreation. 
opyvortunities comparable to current allocatione The second 
assumed a more ambitious ratee For example, the swimming 
equations for the northeast regicn of the Unitea States 
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yielded the following forecaste It was estimated that in 
1965, the year of the survey, approximately 350—-million 
activity-days of swimming occurrede Under the first 
assumption of more limited future recreation development, 
the projected activity-days of swimming in the region (given 
the projected population and other characteristics of that 
population ) are 450-—million days in 1980 and 740-million 
days in 2000. Under the more expansionist assumption for 
future development, the projections are for 630—million days 
in 1980 and 890-million days in 2000. 

| Needless to say, the quality of these projections is 
dependent upon the quality of the data and on their 
analysise (see TN 6, TN 20, and TN 35 on accuracy and other 
problemse) In the case of the Rutgers* Study there were weil 
recognized limits, particularly with respect to the 
quantification of the supply variablese The deficiency in 
characterization of supply is particularly troublesome when, 
for most activities, the demonstrated importance of the 
strong effect of proximity to facilities is considerede The 
shortcoming is particularly important when attempting to 
discern differences in future demands, as they may vary 
depending upon the  Llevels of provision that are made 
available, in particular regarding supply measurement in 
relation to facility provision and proximitye (see TN 5, TN 
16 and TN 17-2) 

There are, of course, certain Limits to the usefulness 
of the analysis in a study such as the Rutgers’ efforte 
However, their general procedures and analyses are well 
worth imitatione There is much insight to be gained from 
such a study in terms of both an indication of future 
demands and ina fuller appreciation of the various factors 
that influence recreation behaviour among the different 
segments of the populatione (For an integrated CORD Study 
perspective, see ITN 33-e ) 


SITE DEMAND STUDIES 


There are a number of policy, planning andy, to some 
extent, management questions on which the more aggregate 
demand studies can provide some useful insight. There are 
others, however, on which guidance related to more specific 
sites or to alternative areas is desirablee For example, if 
more efficient and orderly development of future recreation 
areas is to be attained, guidelines will need to be 
developed to predict the likely use of alternative Locations 
of facilities and types of developmente (see TN 3 on 
alternatives and TN 33 on the integration of a Rutgers'-type 
model with a destination area oriented modele ) Estimates of 
the amount of recreation use that can be expected at a 
particular site, or alternative sites (given proposed or 
Known characteristics of the users ) are necessarye The 
estimated use figures should reflect the effect of planning 
Variables such as the Location of individual sites, the type 
of development carried out, and the competing opportunities 
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available to the relevant populations as well as something 
of the socio-economic characteristics of these populationse 

The estimation procedures used to project recreation 
visits or levels of use of a given recreation area are based 
on quantifying the relationship between observed use of 
recreation sites and the various factors that influence this 
usee The suggested procedure makes use of visitation data 
from a number of sites, for example a series of parks from a 
provincial systeme Using fairly standard and straightforward 
multiple regression methodS, one may determine which of a 
number of different variables are significant determinants 
ot facility use and their contribution or weight in 
determining this usee (For some considerations see TN 19 and 
TN 35 on the need to use other methods than "simple" 
regressione) These factors would normally be expected to 
include such things as the distance of the site from 
population centres of various sizes, possibly a number of 
socio-economic characteristics of these populations, some 
characteristics of the sites, and the alternative recreation 
opportunities that may also be available to the users and 
potential userse 

It would be expected, for example, that all other 
things being equal, a park nearer to a population centre 
would draw more visitors than one which is more distante In 
such a case it is the negative effect of distance that 
brings about the difference in the attendance between the 
twoe With observations on park use from a number of such 
sites, a pattern would normally prevail in which the use 
levels would show a fairly consistent reduction with 
increased distanceSe It the distance effect is significant, 
as one would normally expect it to be, the degree of 
imrortance, in terms of how much reduction would be expected 
per mile or per hour, can be estimatede (On the effect of 
distance, see TIN 14. ) 

If there were only the distance factor to consider in 
making use estimates for a new park, they could be made 
graphically by plotting the observed points, drawing a 
curve, and reading the use of a new park off the resulting 
eraphe Alternatively, when more variables are involved, much 
the same thing can be done by statistically estimating the 
coefficients or parameters of an equation that can be 
employed to predict usee 

The general procedures can be illustrated by an 
examplee The United States Army Corps ot Engineers operates 
a series of reservoirs throughout the United States, many of 
which receive extensive recreation usee Consider seven 
reservoirs in the Sacramento district of the Corps for which 
these use data were collected over four years, 1966 through 
iSeSe During user surveys, interviews were conducted with 
aprroximately 40,000 visitors on their party's visit to 
these sitese While the data were collected on both aay-users 
and overnight visitors, analysis was carried out only for 
day-usersy, that iS, people visiting the site but not 
remaining overnighte 

The origins of the visitors were classified into area 
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groupings based primarily on county or parts of county 
unitse Incidentally, some earlier analyses have used 


concentric rings around sites as the unit of observatione 
The county units, however, while introducing more variations 
in the data, allow more meaningiul Locationally specific 
variables to be measurede As distance from the reservoir 
increased, the size of the origin area unit used in the 
analysis was also increased (see TN 1 where this analysis 
procedure was followed very closely)e The number of areal 
units that were taken as being the points of origin for each 
of the reservoirs varied from 22 for Isabella reservoir to 
26 for Pine Lake and Success reservoirse In all, a total of 
168 observations of origin to lake destinations flows were 
used in the analysis of the visitor characteristics for the 
seven reservoirse 

A number of independent variables were used in attempts 
to explain the variation in the observed visit numbers from 
each of the origins to the lLlakese One variable included was 
the road miles from the areal unit of origin to a reservoir 
of destination for a visite The origin was taken to be the 
population centroid of the "general" origin areae It was, of 
course, expected that, as this distance became larger, the 
number of recreators making the visit to the particular site 
would decreasee A second variable was the population of the 
origin unitse No data were collected on the incomes of the 
visitors but average per capita incomes were a variable 
defined tor each origin areae 

The size of the reservoir measured in acres of the 


recreation pool was also a variable used in the analysiSe A 


size indication was deemed necessary to account for the 


increased capacity of the different reservoirse This Was 
also considered a crude measure of the attractiveness of a 
recreation sitee (see TN 9 and other work cited theree) A 
number of alternative size or capacity variables were 
included in the analysis but ultimately only gross acreage 
was kept because it proved to be as good as any other 
transtormed acreage variable in this casee 


Another variable was included to account for the 


alternative recreation opportunities of a Similar nature 
available to residents) of the various population origin 
arease It would be expected that if readily available 
substitute areas were available to a population in a given 
area, fewer visitors would be inclined to make the trir to 


the reservoir in guestion than iz substitutes were not 


-availblee (see TN 1, TN Go TN 11 and IN 33, also, on 


substitutability, see TN 32 and TN 37-) The variable to 


measure is one that accounts for other lakes and reservoirs, 
with the value being larger as the number of lakes available 
to a population increased and larger with their proximitye 


The equation which was finally derivea based on the 


observed visit patterns was as follows: 


WC ie ds) = —-3303 + (PCi)/DCigj)) (2005 —- (024) CWO Yj )**1/2)) 


+ .0037 wl yj) + 28-79/AC i De 
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WHERE 
VCisj) = number of recreationists going from population 
centre i to lake jy», 


P( i) = the population in the areal unit i, 

D(iysj) = distance between the population centres and the 
reservoirs, 

Ww( Jj) = size of the recreation pool in the individual 
reservoirs, and 

AC i) = measure of alternative lakes and reservoirs 


available to the different populationse 


The variation among the area units proved insufficient 
to obtain reliable estimates of the coefficient so this 
variable was dropped trom the analysise Had income data on 
the visitors been available it might have been possibile to 
relate the number of visitors from each of the counties to 
each of the lakes by incomee 

Nevertheless, atlL the individual coefficients are 
Signiticant at the one percent tLlevele This expression 
accounts for or “explains" approximately 94 percent of the 
variation in visitor totals among the observationse That is, 
ot the variations in the individual visit totals among the 
168 observations, all but about six percent is "explained" 
by this equatione (On the meaning kK* and how regressions 
shoulda be carried out, see TN 19 and TN 35- ) 

In the case of the reservoir study the emphasis was on 
predicting use for a new reservoire Because of the nature of 
the existing lakes and the anticipated nature of any new 
reservoirs, there was little need to concentrate undue 
attention on individual lake characteristicse Undoubtedly 
the lakes differ somewhat but as in the case of most 
recreation areas, to predict their use well, more detail (or 
rather how they differ in terms of specitics) would no aoubt 
be requirede Ultimately, this must be subjected to empirical 
investigatione (On how attractivity relates to park 
characteristics, see TN 4e ) 

While distances were measured in road miles in this 
study, improvements may have been possible with the use of 
travel timee It may also be noted that the reservoir model 
was for one type of recreator —- the day visitore Analysis of 
visit relationships for overnight visitors would need to be 
done in an analogous manner, but separatelye It would be 
expected that different variables might well be important 
and that even the coefficients or parameters of the same 
variables would ditfer: the effects of adistance on the 
camper will not be the same as they are on the day-use 
visitore In the case of the Canadian parks it would seem 
reasonable that a similar separation of visitor types should 
be made for analysis purposese (see TN 30, also TN 18 on 
enroute visitorse ) 

In these cases, there is the turther matter ot 
considering substitute or alternative areaSse In the 
reservoir Study, only other reservoirs and lakes were 
considered as substitutes —- as indeed they no doubt aree 
However, other areas may also be important in this regarde 
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Proximity to the ocean or to the mountains or to Las Vegas 
might have been testede In regard to provincial parks there 
is usually a range of areas that should be similarly testede 
(see TN 37-) 


ON ON 


The range of consideration in demand analysis is widee 
In some useful studies the variables are few and the models 
simple: this was true for the reservoir study but less true 
for the Rutgers’ Studye Ultimately, a far greater number of 
concerns will need to be dealt with effectively. Income and 
population changes are certainly important in some studies 
and here more information is desirable on how use varies 
with these factorse (see TIN 12 and TN 132) How attendance 
figures may be expected to change over time may be important 
(see TN 7e) Simple formulations may be used to answer fairly 


broad questions, but detail is necessary to provide guidance 


for more precise planninge In this regard an analysis is 
largely open-endede 

It is the last point that best typifies the type of 
analyses being suggested throughoute It is not really that 
it is never ending, but rather that different kinds and 
denths of analyses may be used depending upon the problems 
being confrontede And, of course, initial analyses will 
suggest further questionse While unexplained variance or 
(more euphemistically) ‘unknowns! will persist and call tor 
Jjudgement based on the experience of planners, the chances 
of helpful guidance stemming from such quantitative study is 
a certaintye 
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CHAPTER II 


DESTINATION MODELS 


Introduction 


The first analysis effort conducted within the CORD | 
Study was the development of destination modelse Cheung'!s | 
work on a model of visitor flows for the day-use of parks 
began in 1970e His model is presented in TN ly, A MODEL FOR 
ESTIMATING DAY-USE OF PARKSe TN 1 was important in the. 
development of TN 7, AN APPLICATION OF MATHEMATICAL MODELS 


TOGO COMPARE TWO POTENTIAL PARK SITESe 


Concern with overnight-use of parks arose early in the. 
CCRD Studye Shortly atter Cheung began work on a day-use_ 
model, it was recognized that if CORD Study analyses were to. 
be of broad importance in Canadian outdoor recreation. 
planning, it would be necessary to develop models that would) 


allow the prediction of the volume of a number of types of 
recreation use of parks, including overnight—uSsee For this 
reason, a contract was initiated to develop a work plan tor 
a mathematical model to predict overnight-use of parkse 


Unfortunately, this work did not proceed beyond the stage of. 
model developmente Problems with the CORD Study Park User) 
Surveys prevented the work plan from being initiatede . 
Nevertheless, CORD Study TN 30 presents ideas on i) the need 


to aisaggregate visitors into various classes, and 2) the 
kinds of models that are appropriate for different classes 
of visitorse 

At about the same time that work on TN 30. was 


initiatea,y a proposal was received to develop a model for 


estimating park attractiveness and population centre 
emissivenesSSe Cesario proposed the development ot a new 


modelling concept that appeared to have great (but unproven) 


potential in the analysis of certain recreation travel > 
problemsSe When Cesario sought support to develop the model, 
it was throught that the CCRD Study Park User Survey data. 


would not be satisfactory for building the proposed modele 
In fact, because of data problems, the initial proposal was 
hetd in abeyancee But, eventually the development of an 
attractiveness-emissiveness model was pursued as a CORD 
Study project and resulted in TN 4e The Cesario modelling 
project made little use of COKD Study datae Instead, 


accurate data on camping were obtained from the province oi) 


Cntarioe 


By its very success, the Cesario modelling effort 


raised unexpected questions that became the focus of concern 
in other Notese Specifically, Cesario defines what he calls 
the attractiveness of parks and the emissiveness ot originse 
These concepts give rise to questions about what 
attractiveness and emissiveness really aree Attractiveness 
is discussed in Chapter IIIe One issue regardiny Cesario's 
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-emissiveness is whether it is a property of a city, or 
whether it is also related to the configuration of supply 
that occurs around the citye IN il, which was one of the 
last Notes prepared, presents proof that Cesario's 


emissiveness is not a simple concepte Rather, it is 


suggested that the emissiveness of acity is generated by 


its inherent emissiveness and the etfect of alternative 


sites around ite The Last Note prepared, Technical Note 33 


| 


(Chapter IX), also deats with emissiveness and pursues the 


meaning of attractiveness in the Cesario modele It also 


shows how a generalized Cesario model "integrates" much of 


| 


the work presented in this volumee 
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CORD STUDY TECHNICAL NOTE 1 


A MODEL FOR ESTIMATING DAY-USE OF PARKS 


He Ke Cheung 


ABSTRACT 


One aspect of rational park planning is developing 


site-specific models that give insights into the structure 
of relations that determine participation ata given park 
which is part of a systeme This research reported on was 
carried out to obtain a conceptually and structurally more 
meaningtul site-specific model (often called travel model in 
the Literature )e The model developed is site-specific in 
that it deals directly with use of twelve individual sites 
in Saskatchewane Alternative recreation opportunities and 
park attractivity are related to quantitatively defined 
explanatory variables considered in formulating an 
appropriate estimation trameworke 

The alternative factor use is defined in terms of the 
Location and the number of alternative sites available to 
the visitors to a particular site, while attractivity is 


expressed as a function of the physical characteristics of. 
the facilities and the services offered at the site visited 


by a party from which information was obtainede 


The data for this study were collected as part otf the | 


CORD Study 1969 Park User Surveye The paper does) not 
rrovide details about the survey but documentation on it is 
available elsewheree 

Multiple step-wise regression was used to derive a 
relationship between total season attendance figures (the 


cependent variable ) and the explanatory variables = 
population, distance, alternative recreation opportunities 
and attractivitye The results indicate that a particular 
combination of the variables population and distance 


explains a large amount of the variance in the Saskatchewan 
day-use data for the 12 parks considerede 

An application of the model to determine a desirable 
Location tor a new park is given as an illustration of tte 
useiulness fo the model to planners and managerSse 


INTRODUCTICN 


During the past decade many investigations have been 


conducted into the use of outdoor recreation tacilities to 


determine a quantitative relationshir between parx use and | 
its key determinantSe Much work has been done on determinin: | 


functions that best express the “inhibiting force" ot 
distance and travel timee In their reservoir studies, Ullman 
and Volk (see keterence 35) tound that per capita 
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participation was related to distance raised to a power 
ranging from two to foure Other references to the 
investigation of the effect of distance on trip movement can 
be found in References 21 and 33-6 

Studies considering the effect of intervening 
opportunities (for original formulations of the theory see 


References 33 and 34) and site attractiveness are 
relatively raree In the Texas reservoir study a number of 
explanatory variables, including a "gravity" variable, were 


incorporated into a double logarithmic regression equation 
and were found statistically significant in explaining the 
number of visitor days from a county of origin to a 
reservoir per unit timee The gravity variable, defined as 
the common logarithm of reservoir size divided by the 
distance between the reservoir and an origin, was designed 
| — reflect the competitive effect of other reservoirs 
located within a 100 mile radius of the population center of 
each countye A modification of this approach was adopted for 
the model that is presented heree 

Clawson and Knetsch (see Reference 11) suggested the 
possibility of developing specific, and rather objective, 
rating scales to measure the attractiveness of outdoor 
recreation areaSe Van Doren (see Reterence 37) devised a 
camping attractiveness index for each of 59 Michigan state 
Marks. His index was based on 55 variables related to (1) 
outdoor recreation activities, (2) natural environmental 
resources, and (3) camping facilities and servicese Rather 
subjectively determined values were assigned to each of the 
variablese Factor analysis was employed to derive aggregate 
scores for the variables belonging to (2) and (3)e The 


camping attraction index of a park was a weighted 
combination ot its scores on the individual variablese 
Another formulation of the attractiveness index (see 


Reference 37) simply considered the type, quantity and 
quality of facilities offered and was defined as a _ sum of 
productse The “utility" of having an activity and the 
quality of the activity were multiplied and this product was 
added for a set of activitiese This form of the 
attractiveness measurement does not require data as 
elaborate as that required by the Van Doren approach and 
thus is convenient in anatyses where only limited data are 
available, as was the case in this studye 

| In California, Pankey and Johnston (see Reference 27) 
found that prediction precision was gained by estimating 
day-use and overnight—-use separately and using the sum of 
these quantities as the prediction of the total usee The 
aprroach of disaggregating users was followed in this studye 
The study, however, deals exclusively with main-destination 
Gay visitors, aetined as those who ¢g0o0 to a park as their 
sole trip purpose and return home on the same daye 


DATA COLLECTION 


A visitor survey conducted in a number of Saskatchewan 
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parks during the summer of 1969 constitutes the data base 
for this studye The survey methodology used to obtain the 
data was similar to that described by Crapo and Chubb (see 
Reference 13)e Handback questionnaires in card form were 
distributed according to a probability formula to visitors 
entering parks through access gates during daylight hourse 
These cards were distributed by either a special survey 
staff or park attendantse Retrieval of the cards was 
accomplished by voluntary deposit in collection boxes placed 
near the park exit gatese The overall return rate was about 
56 percente 

The information gathered can be divided into three 
categorieSse The first class of data concerns user 
characteristics, such as party composition, family income, 
occupation, and educatione The second deals with facilities 
used: examples of these are the picnic ground, the bathing 
beach, and the hiking trailse The third pertains to travel 
characteristics, chiefly visitor origins, purpose of trip, 
and length of stay in the parke Only the last class of 
information was utilized in this studye 

The formula used to obtain day-—use estimates is 
described in the version of of this article which appeared 
in the JOURNAL OF LEISURE RESEARCH (see Appendix 5B)e 


Specification of the Model 


A generalized outdoor recreation travel flow model was 
assumed to have the form: 


C1) Wliej)d = FCOCI)D, DCisgjd, SC J)) 


WHEFKE 

Vliesj) = number ot "visits" trom origin i to site j during 
some finite time period, 

C( i) = characteristics of origin i, such as the size and 
the socio-economic characteristics of the 
population, 

D(Ci,szj) = a variable to account for the effect of the — 


spatial separation of origin i and site jy and 
the characteristics of site jx such as the natural 
features and the man-made facilities at the sitee 


SC j) 


For the sake of convenience and simplicity, linear 
repression was selected as the tool of analysisSe An additive. 
model anda multiplicative model fitted in logarithmic form 
were considered; the former was chosen even though the 
latter may have some desirable propertiese The reasons were 


twotolde First, the dependent variable, which is to he 


defined Later, contains zero values: the logarithm of zero 


is undefinede This problem with zeros can be overcome by 
adding a small constant to every observea value; yet there 
is no sound theoretical basis for this techniquee Second, 


TN CHAPTER? *I1 page 42 


and more important, the magnitude of the error propagated in 
the inputs when an additive model is used is generally less 
than that when a multiplicative model is used (see Reference 
1). Hence, the generalized model is specified in the 
additive form: 


Meo) ied), = ClO) + CCAIXC1) + CC 29K(2). + oo 
see e (n)X(n) + UCisj) 


WHERE V(i,j) is defined as in Equation 1, Xi»y eee, X(n) are 
-regressors which are combinations of the explanatory 
variables, and U(i,j) is an error term assumed to have zero 
mean and constant variancee The CO, ees, Cln) are regression 
coefficients to be estimatede 


The Variable 


The dependent variable for this study was the number of 
_day-visiting parties traveling to a park during the 1969 
season (May to October) from a given area of origine The 
area surrounding each park to be studied was divided into a 
number of origin areas which are reterred to as observation 
unitse Each observation unit consits of a cluster of 
contiguous census subdivisionse The number of observation 
units for a particular park varies from 13 to 24. A total of 
231 observation units were definede 

Based on the research findings cited earlier, it was 
felt that four basic explanatory variables would be adequate 
to explain the variance in the data being used in this 
studye It is well known that the number of day-visits made 
to a park varies directly as the size of population of an 
area of origine The population for each observation unit i, 
PC i), is taken to be the total population residing in all 
the census subdivisions comprising the unite 

To examine the effect ot accessibility on recreation 
opportunities, an impedance factor, denoted by D(Cisj), was 
defined in terms of actual road miles from the Largest 
population centre in observation unit i to the entrance of 
park je Population centres that were more than 200 miles 
trom the park under consideration were excluded, Since an 
examination of the data revealed that day-visitors whose 
origins are more distant are insignificant in numbere If 
there was more than one route available, the shortest route 
to the given park was usede 

To retlect the competitive etfect of alternate parks 
available to people residing in different observation units, 
a variable called the "alternative factor" was constructed 
for each of the 231 observation unitSe This factor was used 
to test the hypothesis that fewer visitors would be expected 
to go to a park ata given location from an origin that has 
many proximate alternate parks than from an origin with few 
close alternativese The alternative factor used in this 
study retlects the existence ot alternate parks within 100 
actual road miles of an origin, since most of the main 
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destination day-visitors recorded in the survey traveled 
under 100 miles to a parke 

In the construction of an alternative factor three 
elements are usually taken into accounte These are the 
number of, the Locations of, and some physical properties of 
the alternative sitese An alternative factor which has been 
used (see, for example, Reference 19) is of the form: 


(3) ACi) = OE TCk )¥*a/DCi,gk )**b 


WHERE 

AC i) = alternative factor for i, 

T(k&) = a variable reflecting some characteristics of the 
alternative site Ky, 

D(i,gk) = distance between i and alternative site k, and 

ay b = constants, and the summation is over all 


alternative sites, Ke 


Since data on the characteristics of most of the 
alternative parks, which were largely local and regional 
parks, were lacking, T(k) was assumed constante Different 
values of b were considerede In particular, b was tested at. 
1/2, 1, 3/23 and 2e The value of b finally selected was 1/2, 
Since OE 1/D(€i,k)**1/2 explained more variance of the. 
dependent variable than the other three forms (see Table 1 )e 

This resulted in the use of the following form: 


(4) ACi) = OE 1/D0Ci,k) **1/2; | 


WHERE 
i= 1, eee? Zane 
ji # RK = 1,9 eoocy M where j refers to the park under 
consideration, and the summation is over k_ 
sitese | 


To measure site attraction, an “attractiveness factor" 
was derivede It is a function of the degree of popularity 
of a selected List of day-use activities and the quantity 
and quality of their associated recreation facilities. Its 
derivation assumes that the capacity of all parks is- 

| 


essentially infinitee (Park planners in Saskatchewan 
remarked that crowding did not appear to influence the day- 
users in most of the Saskatchewan parks)e In deriving a 
numerical value of attractiveness for each of the twelve 
parks under consideration, two sets of CORD Study data were 
utilizede The first set of information was a 1969 survey in 
which approximately 3,000 individuals throughout Canada were. 
interviewede Information gathered included frequency of 
participation in 26 outdoor recreation activities, Location 
of participation and reasons for nonparticipatione The 
secona set of information was a set of inventory data which 
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TABLE»: 1 


: A COMPARISON OF THE EFFECTS OF Tin DIF PERENT FORMS OF THE 


ALTERNATI VE PACTOR USED TIN A URECEY SSLON, BOUALTION 


ALTERNATIVE REGRESSION STANDARD INCREASE F-LEVEL OVERALL 


FACTOR* COEFFICIENT ERROR IN R? TO ENTER R2** 
| oh 
e1/d, -36.60 | BID2 0.06 143.79 0.91 
B/D, -102.48 T4277 0.04 85.35 0.89 
mp, */? aL OT eS 0.02 eR 0.87 
ei/d.” -198,13 47.59 upega 17.56 0.86 


e The. atternative’ factor 15° miteipivec’ by pI as described 
in the text. 


This R* value is the value obtained using Equation (7). 
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contain site characteristic information such as the length 
of a bathing beach, number of showers, number of picnic 
tables, etce (See CORD Study Data Documentation Volume. ) 


Table 2 presents the activities used in calculating the 
attractiveness factor, as well as the operational 
definitions of activities and the facilities for a given 
activitye 


The attractiveness function is defined as: 


(5) T( Jj) @E SfCe) JE RC mjQ(m) 

WHERE T(j) = attractiveness of park Jy 

and the other terms and sums are as defined in the following 
paragraphse ; 


TABLE 2 


DEFINITIONS AND ASSOCIATED FACILITIES 
FOR DAY-USE ACTIVITIES 


ACTIVITY DEFINITION FACILITIES 
Swimming Swimming at a Bathing beach 
bathing beach Water quality 


#of showers 


Boating Recreation use of #of boat ramps 
boats with or #of boat piers 
without motors #of boats for rent 

Horse riding Riding for #of horses 
recreation for rent 

Hiking Walkinzx a #of trails 


natural trail 


Picnicking Preparation and/or #of picnic tables 
eating a meal Quality of shade 
outdoors 

Golfing Playing Golt Golf Course 

25 SOx ><> 


As mentioned before, a 1269 household interview survey 
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provides a set of Participation rates for 26 outdoor 
recreation activities which may be used to assess 
attractivenesse The participation rates relevant to the day- 
use activities considered in this study are presented in 
Table 3Ae The term S(e), which is used to take into account 
the tact that not all outdoor recreation activities are 
equally popular, is defined as the participation rate X(e), 
of activity e, divided by the participation rate of hiking 
(the choice of the participation rate of hiking as a 
reference point is arbitrary)e Thus defined, swimming is 
seen to be 1-10 times, while boating is only 0e79 times, as 


TABLE 3A 


POPULARITY RATINGS OF SOME DAY-USE ACTIVITIES 


Relative 
Saskatchewan Popularity 
Participation Rating 
Activity Rates X(e)% SC e )=X(Ce )/29* 
Swimming 32 1.10 
Boating 23 0.79 
Horse Riding 10 0235 
Hiking 29 1.00 
Picnicking 69 2039 
Golfing 17 0.58 


* The choice of 29 as reference for S(e) is ‘arbitrarye 
Sle) = relative popularity rating of activity e and dE 
S(Ce) is summed over ee 


OTOL OZOLS: 


popular as hiking (see Tables 3A & JB)e 

Acceptine the scale as showing that not all outdoor 
recreation activities are equally popular leads to the 
consideration that not all outdoor recreation facilities are 
equally important in drawing attendance to a parke To 
evaluate the relative importance of the facilities, rank 
correlation coefficients between total day-use at the 5 i - 
parks considered in this study and each of the facilities 
listed in Table 2 were obtained using Spearman's’ rank 
correlation coeftficiente The resultant coefficients are 
denoted by Y(m) in Table 3Be The relative importance rating 
R(m), tor tacility m, is obtained by dividing Y(m) by the 
rank correlation coefficient for a boat rampe SO, a bathing 
beach is assumed to be 1e88 times as important as a boat 
ramp, etce (see Table JA & JGB)e 

To account for the quantity or quality oft the 
facilities, rank numerical values were assignede Ihe ranks, 
aqenoted by Q(m), range from lt to 12-e For instance, when the 
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TABLE 3B 


IMPORTANCE RATINGS OF SOME DAY-USE FACILITIES 


Relative 
Importance Importance 
Associated Rating Rating 
Facility of Facility of Facility 
Activity Y¥(m) R( m )=Y¥(m)/0031 
Swimming Bathing Beach 0258 1-88 
Water Quality 0216 0-52 
Showers 0245 1.245 
Boating Boat ramp 0e31 1-00 
Boat pier 0233 0294 
Boat rental 0-19 0.61 
Horse Riding Horse rental 0-20 0665 
Hiking Trails Oeil 0-236 
Picnicking Picnic tables 0263 2e01 
Shade O6« 18 Oe 58 
Golfing Golzt course 0-41 1.32 
NOTES 
* The choice ot OQc31 as reference point for R(m) is 
arbitarye 
R(m) = relative importance rating of facility m3 
O(m) = score of facility my according to its 


quantity or qualitye 


and @E R(m)Q(m) is summed over me 


POLO LOLO LO 
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number of picnic tables in each of the twelve parks 
considered in this study are ranked, and the quality of 


Shade at the picnic grounds in these same parks are ranked, 


one obtains: 


# picnic rank quality 
tables value otf shade 


<4 
® © 
- 5 
co Rr 
0 


58 

&1 

100 
100 
105 
107 
140 
147 
148 10 
163 11 
270 12 


0S Oe Oe 
3} 9) 
QAaHAaAQADA NH Wh 
Om mM mH MO WwW WW = 
@ @ 6 @ 
own mw 


WHERE: P denotes poor; F, fair; and Gy goode 


Hence, it may be seen that “100 picnic" tables received 
4.5 points while "10 horses for rent" received 6 points and 


so on (see Table 4)e 


With the S(Ce), R(m),y and Q(m) values calculated in the 


way discussed, it is possible to use Equation 5 to calculate 


various attractiveness valuese The attractiveness values, 
T(j) of the twelve parks are Listed in Table 5e 


The Hypothesis To Be Tested 


It was expected that population size, accessibility, 
alternative recreation opportunities, and attractiveness 


would interact to produce the observed differences on park 


usee To capture the interaction effects among the 
explanatory variables, four regressors were hypothesized to 
be necessary in an additive modele These were 


PC i)/e( Dlisgj)), PC i dACIi)D/e@(DCiygj) dys TC j)/e@(Dlirsj)) and 
1/2x(DCi,j)) where gl Dliej)), which varies with Dli,gj) is 
aetined as follows: 


(6) DOs j0/2 when Oe eDGiise 4) i See's 
2z( DCi,j) — D(C isj) zZ20 < DCisJj) x 555 
D(C i, j )** 3/2 55 < Dlisj) 
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TABLE 4 


RANK SCORES* OF FACILITIES BASED ON GIVEN QUALITY OR QUANTITY 


FACILITY RANK VALUE, Q(m) 

Bathing Beach Fair — 4, Good - 9 

Water Quality Poor — 305, Fair ~—- Sy Good - 9 
# Showers 6.— 8, 8 — 1065 


# Roat Ramps 1 Ss 2- 8Se5ye 3- 10, 
S§ ~ il, 8 ~'12,.1—- 7-5 


# Boat Piers 1 —- 7-5 


# Rental Boats 8 —- J, 10 — 8, 14 -— Q, 
16 —- 10, 30 — 11, 140 - 12 


# Rental Horses 10 — 6, 12 — 7eS, 17 — YQ 
22 — 10e5, 30 - 12 


# Trails 1i- 63 2- 105, 4- 9, 


# Picnic Tables 38 —- 1,5 58 —- 2, 81 —- 3; 
1900 — 465, 105 — 6, 107 - 7; 
140 — &8, 147 — 9, 148 —- 10, 
163 - 11, 270 —- 12 

Quality of Shade Poor — 1, Fair —- Ge5y Good - 9 


Golf Course 9 Hole — 8e5, 18 Hole - 11 


* The word before a dash refers to the quality of a 
facility, while the number before a dash refers to the 
quantity of the facilitye The number after the dash is the 
score of the facilitye 


<>< POKIPKLOD 


EN “22 CHAPTER 11 page 3svU 


TABLE $ 


ATTRACTIVENESS RATINGS 


Park Attractiveness T(j) 
Buffalo Pound 96012 
Cypress Hill 45 e26 
Duck Mountain 126-40 
Echo Valley 112.05 
Good Spirit 16256 
Green Water 61.46 
Prince Albert 88e75 
Moose Mountain Lidell 
Pike Lake 96-10 
Rowan's Ravine 59-01 
Battleford's 104.28 
Besant 26260 

POT OZOTOLO. 


With the four regressors included, Equation 2 takes the 
| following form: 


C7) VCiggj) = COFCCIPCA JFC2PC( i DAC i DFCITCI I*tC4)/g( D0 i,j)) 
| WHPRE 

Vliyj) = the dependent variable, which is the 
number of vehicles in hundreds estimated to 
i be travelling to park j from observation unit 
| i per season, 
| P( i) = population, in thousands, of observation 
unit i, 
{ . 
Digg) = road distance in miles, trom the largest 
population center in observation unit i to 
Mi park Jj» 
| ACi) = alternative factor for observation unit 
i ° 
TO yj) = attractiveness of park Jy 


CO,g ees C4 parameters to be estimated, and 


e(DCi~ej)) is definea by Equation be 


| The distance tunction gl(D(i,zj)) has also been replaced 
by continuous functions of the form: (D(i,.j)) = DCiygj) Lc], 
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O < Dlisgj)s where c = 1/2, 1, 3/29 2, S/2e and Ge Results 
showed that g(D(i,j)) gives a higher R* value than the other 
forms used (see Table 6)e. 


TABLE 6 


A COMPARISON OF THE EFFECTS OF THE DIFFERENT FORMS OF THE 
DISTANCE FUNCTION USED IN A REGRESSION EQUATION 


Distance Function Overall R?* 
DC i,s j)**1/2 0-64 
DCieJj) 0-84 
DC i,j *¥*3/2 0-90 
DC i,j )**2 0-90 
DC isgj **5/2 0-89 
DC i, j**3 0-287 
2(Dli,sJj) 0-91 


* This R® value is the value obtained using Equation 7e 


COLO TOLOLS 


REGRESSION RESULTS 


To estimate the parameters of the model described in. 
Equation 7, stepwise multiple regression techiques was. 
applied to aata which consisted of the 231 observations (one 
for each observation unit )e The equation obtained was as 


follows: 


(8) VCisj) = 16393 + (1200631 PCI) — 3G6e60 PC id)ACIi) + 
1e25 TC j) — 104-56 )/20 D0 ig j) de 


The analysis of variance table for the regression 15 


presented in Table Te The coefficients, their standard | 
errors, F-values, and the R® value at each step ot the 


regression are presented in Table S&e 


The equation determined is a reasonably good predictor | 


Since the overall F = 562-93 for regression is much more 


than 4 times the tabulatea F = (4,266,0.999) = 4462 with a 


Oel percent$ as it is suggested it should be by Draper and 
Smith (see Reference 16)e The term P(i)/ga(D(i,s,j)) was the 
first to go into the equation and it explained a substantial 
amound (&4%) of the variation of the dependent variablee 
Therefore, the equation Should be very sensitive to a 
population cr cistance changee The term i1/g(D(Ci,yj)) serves 
as a correction term ot Pli)/g(D(Ciyzj)) since some ot the 
Larvest residuals were tound observetions which were at 
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TABLE 7 
THE ANALYSIS OF VARIANCE FOR THE STEPS 
IN FITTING THE VISITATION EQUATION (EQUATION 8) 
Degree of Freedom Sum of Squares 


Regression Residual Regression Residual 


1 229 120074-.81 22594.67 
2 228 128813.25 13856-¢22 
3 227 129088.-81 13580.66 
4 226 129656213 13013234 
Mean Square~- 
| . Overall 
Noe Regression Residual F* 
1 120074-81 98-67 1216-97 
| 2 64406-63 60-77 1059.79 
3 43029-60 $9283 719.24 
4 32414.03 $7.58 562.93 


* These values are all significant at the Qcl per cent 
| probability Levele 
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short distances from a parke The term P(i)A(i)/g(D(i,yj)) was 
the second to go into the equation and it explained six 
percent of the total variation of the dependent variablee 
Of all regressors, Tli)/g(D(Cis,j)) contributed least in 
increasing the R*® valuee Nevertheless, the precision of the 
estimate of the dependent variable was increased, even 
considering the loss of one degree of freedome (It may be 
noted that the addition of a new regressor will generally 
increase the R* value (and never lower it) but it will not 
necessarily increase the precision of the estimate of the 
dependent variable, see Reference 16¢ ) 


In more practical terms it is important to reflect on 
the applied significance of the regression coefficientse It 
is interesting to note that the P(i)/g(D(Ci,j)) term, which 
first entered the equation and explained 84 percent of the 
variance, suggests an equation of the following form (when 
the constant term 1-33 is ignored): 


VCigJ) = KPC i)/g(DCisj))e 
Given that this gravity model has explained so much of the 


variance, it is clear that it is an excellent first 
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TABLE 8 


STATISTICS ON THE REGRESSION COEFFICIENTS OF THE 
VISITATION EQUATION* 


Regression Standard 

Coefficient Value Error F-Value R2 
co 1.233 
cl 120-31 5-80 429280 0-2-8416 
C2 —-36260 3e12 137.281 029029 
C3 ~104.56 27230 14.63 029048 
C4 1.25 0240 9e85 0.9088 


* ALL regression coefficients are significant at the one per 
cent probability level and all hawe the expected signs. 
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approximation to the way people behavee This means that at 
best the pattern of alternatives and different site 
attractiveness can only explain some of the remaining 16 
percent of the varianceée 

Both attractiveness andthe alternative factor are 
introduced in a rather ad hoc way in this analysise The 
kinds of arguments used suggest achange in participation 
would result from changes in an index reflecting a number of 
alternatives or in another index reflecting the quantity and 
quality of facilitiese The fact, however, that the given 
indexes should be related to usage by Equation 8 rather than 
by some similar form has no sound theoretical basise Yet, 
one can at least hope that a model which explains part of 
the 16 percent of the remaining variance after a "sample" 
gravity model is used will be a guide to a more 
"structurally" sound model leading to improvement in one's 
understanding of outdoor recreation phenomenon and thus to a 
better theorye 

In particular, the fact that the regressor involving 
attractiveness goes into the equation with a low explanatory 
power leads one to suspect that the functional form of 
Equation 8 is incorrect in some respect or that the 
attractiveness measure itself is unsounde (See TN 9 and TN 
28-) It is suspected that the problem has more to do with 
the error in the functional form than with the unsoundness 
of the attractiveness measure adoptede Still a computer run 
using V(i,j)/P(i) as the dependent variable and 1/g(DlisJj)), 
ACid/eCDCisgsjdI)» TOI)/g( D(C iygj)) as regressors resulted in 
TC j)/e(D(iszj)) entering first into the equation and 
explaining 69 percent of the variance in the first step of 
the step-wise regressione 
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| 

The coefficient of the alternative factor term is of 
more immediate practical interest than that of the 
Me tractiveness terme The value of A(i) for the various 
observation units in Saskatchewan ranges from 0 to 2.59 with 
an average of 1-48. One may regard the site for which there 
is no alternative as an “isolated site" and call the 
attendance figure of this site its “jsolated site 
potentiale" Compared with isolated site potential, the 
average and maximum reduction in attendance due to 
alternatives, when other terms in Equation 8 are held 
constant, are thus (1.48 x 36.60/120.31) x 100% =54% and 
(2-59 x 36260/120-e31) x 100% = 79%, respectively. Of course, 
the model is meaningless beyond certain Llimitse If 
attractiveness is ignored, a negative number of visitors 
would result when A(i) is greater than 3e28e This Limitation 
on the model strongly suggests that it will predict poorly 
for an origin having dense clusters of alternatives at short 
distances from the ‘origine Other than this, no undue 
difficulties will likely arise as a result of the 
alternative factor formulation when the model is applied. 


APPLICATION OF THE MODEL 


One of the main objectives of developing the day-use 
model was to predict the number of day-use visits to a park 


with a certain level of developmente Because of the 
consideration of population size, accessibility, alternative 
recreation opportunities, and attractiveness it is 


reasonable to suggest that the model enables the planner to 
choose among alternative sites in deciding where a proposed 
park should be built (for an application, see TN 7)e 
Intuitively, given certain assumptions regarding reaching a 
new usage equilibrium, the introduction of a new park will 
draw visitors away from existing parks and possibly induce 
new usage oa On the other hand, the removal of an existing 
park from the system can be studied in a similar waye It is 
the pattern of change in use which results with a change in 
8 system of parks that is of concern heree 

| To illustrate the planning use of the model, use 
estimates have been derived for a hypothetical park about 
thirty miles south of Rosetown in Saskatchewane The location 
of the park, the observation units, and the corresponding 
population centres are shown in Figure le It is assumed 
that the facilities which the new park will contain are: a 
good beach with good water quality, eight showers, five boat 
ramps, one boat pier, 30 boats, 17 horses for rent, six 
trails, and 140 picnic tables with good shadee The 
Characteristics of the observation units used for the 
calculations used to determine total use figures for the new 
site are listed in Table Qe 

| On the basis of the assumed characteristics and 
available census data, a total of 9,372 vehicles from the 15 
observation units could be expected to come to the new park 
for a day-use visit during a given seasone Attendance 
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{gure | 
Observation Areas of a Hypothetical Park 
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TABLE 9 


PREDICTED MAIN-DESTINATION DAY-USE OF A HYPOTHETICAL PARK 


FROM OBSERVATION 


Observation 
Unit 


CmNaUNh OND 


TOTAL 


Observation 


Unit 
1 
2 
3 
4 
S 
6 
7 
8 
3 

10 

14 

12 

13 

14 

15 


* The 


TN 1 


attractiveness 
hypothetical park is 119e33- 


Population 
in Thousands, 
1966 


3096 
5e53 
5268 
117.41 
7284 
1236 
1.74 
163 
4018 
2085 
3e81 
1.78 
7248 
5018 
4.98 


175-41 


Distance, in Miles, 
From Population Center 


To the Park 
124 


rating, 


Population 
Center 


Scott 
Biggar 
Delisle 
Saskatoon 
Kindersley 
Dodsland 
Herschel 
Harris 
Resetown 
Dinsmore 
Lucky Lake 
Elrose 

Out Look 
Eston 
Cabri 


Alternative 
Factor 
0.73 
0044 
0248 
0263 
1.31 
0255 
0229 
0.40 
0655 
0276 
1-09 
0-60 
0-68 
Le2d 
1-50 


TOTAL 


by Equation 5, 


CHAPTER II 


UNITS HAVING GIVEN CHARACTERISTICS* 


Este 

Vehce 
(100's) 
1-61 
2052 
2030 
12.254 
2210 
1.56 
5Se71 
52020 
16241 
11.65 
1.92 
10-30 
16-10 
22020 
1.60 


93072 


for the 
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figures from the various origins may be used for traffic 
consideratione Total use may of course be related to the 
demand on projected facilities, the need for staff; and the 
required budget for operatione 


CONCLUSION 


The model (Equation 8), developed in this paper, 


explains a reasonably large percentage (91%) of the 
diversity in the dependent variable, and the regression 
coefficients are all significant at the one percent 


probability levele So, the example of the application of the 
model to planning seems appropriatee 
Improvements in the precision of the equation can 


perhaps be expected by using travelling time or a 
combination of travelling time and distance to formulate a 
new accessibility variablee Further research on 
attractiveness and the alternative factor to refine present 
methodologies and develop new methodologies for the 


measurement of these factors also seems appropriate (see ITN 
29 TN Se T™N 4, TN 9 and IN 28 )e 
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CORD STUDY TECHNICAL NOTE 7 


APPLICATION OF A MATHEMATICAL MODEL TO COMPARE THE USE 
OF TWO POTENTIAL PARK SITES 


Je Beaman, He Ke Cheung; and Se Smith 


ABSTRACT 


| Calculations of predicted visitor flows to a park 
depends (1)on being able to evaluate the probable attendance 
at the new park at a given time and (2)on being able to 
estimate the trends in attendance that are important in 
explaining changes in the predicted attendance and using 
these to estimate the attendance at the park at given future 
times. In this paper it is reported that past research 
resulted in the development of a mathematical model that was 
used to predict use of the two proposed parks being 
comparede Then available trend data were used to forecast 
use in the future based on the given season for which use 
had been estimatede 

A special class of non-day users, main destination day 
users, was designatede The other class of user considered 
was campers who stay at least one night at a parke The 
importance of recognizing these classes of users to making 
estimates of the attendance at the two parks under the 
development options is stressede 

Also, policy considerations relevant to how the results 
of analysis are presented are introducede It is pointed out 
that use figures predicted for the two parks are broken down 
by origin and type of users (day-use and camper) for a 
reasone This is so that managers responsible for selecting 
an option will know more than total usee The manager will 
know “who" is being served and howe 


PURPOSE 


The purpose of this Technicat Note is to present the 
rationale for, and some of the details of the methodology 
used to predict, attendance at two potential park sites in 
Ontario, the Georgian Bay site and the French River Mouth 
Sitee 


INTRODUCTION 


That objective is not the original onee The note was 
originally prepared by Cheung and other CORD Study 
researchers to communicate a method of making estimates that 
was the best method these researchers could arrive at for 
making estimates in 1972, given their problem and the time 
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constraints under which they workede There was a legitimate 
desire to make the work known to others so that should they 
find themselves in similar circumstances, they could utilize 
those of the ideas presented that they found relevante 


However, since that original work was carried out; a number 
ot research projects have resulted in good park use data 


(which is readily available) in a number of provinces and in 


the development of overnight use models that remove the need 


for making approximations ot overnight use based on the 
amount of day uSsee As well, there have been other 
developmentse Still, it is considered worthwhile to give 


some details about what was done and why; and to show what 
was not done because it was not possible with the data or 


the methodologies then availablee 
It is useful to recognize that an original objective of 


the project of calculating visitor flows to the two sites 
considered here was to evaluate their relative use as - 


National Park sitese- In general terms the points to be. 


clarified by the modelling effort were (1) who coula 0be 
expected to be served and (2) howe Given the data available 
when the study had to be done, the only variables which 
could be considered that would aid policy makers were types 
oft users and origin of userSe However, when the researchers 


started to seriously consider the information on types and. 


origins of users, they found that a wzreat deal of 
intormation could and should be developede For example, the 
question of who is being served can be partially answered by 
@iving a distribution of the origins of the future visitors 


to a site that will be developede However, it users from one. 
origin come primarily auring aaylight hours during the week 
and others from more distant origins come exclusively on the: 
weekend for camping, this information should be given to. 
planners and managers who are considering the site or 
comparing it to the merits ot ditferent sitese It would be. 
inefficient to provide for weekday camping when the greatest. 


demand occurs on weekendse On the other handy, weekday day- 
use facilities ana programs might be providea at avery Low 
cost (relative to camping facilities) but because of a4 
proposed park's Location these facilities miyht serve only 
one Local communitye Locating the park at another site 
might, with the same tracilities, serve more communitieSse 

in a similar vein,yg when one is not concerned with the 
origin ot users, the presentation of one or two use tigures 


that tell the whole use story to managers is an] 
impossibilitye In this Paper, tigeures are given on the 
numbers oft total visitors as well as tigures on the number. 
of total visitor-dayse The "numoer of visitors" to a park is’ 
obviously not a precise meaSurey, the only measure or evea. 


the best measure of the use of a park because, for example, 
visitors will stay for different Lengths of time in a park 
ano wany visitors will visit a park several times on one 


trip while cthers visit it many times in one seasone A park 
Visit can tast anywhere from a cuouple of hours to a week or 
mOTee BOLe measure raw Oia eK SAt LW be. SeMere p c<ay-use and overnight—use 
and averave Length of stay is necessary for the planning ot 
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services, programs, personnel needs and maintenance and for 
evaluating the potential regional economic impact of a park 
development. 

Though results are not presented here, the distribution 
of use between weekdays and weekends is importante The 
possibility of developing predictive weekend and weekday-—use 
models is discussed in TN 8 where an explicit methodology 
for defining Loading curves for different types of park 
users by origin is specifiede More theoretical statements of 
the need for a variety of different park use models by user 
types are presented in Technical Notes 30 and 406 In this 
‘present note the need to recognize the breakdown between 
weekend and weekday use was clear; however at the time of 
writing, carrying out the actual calculations was not 
ad 


; THE MODEL 


The basic model used in calculating the use figures for 
the two parks considered here was: 


(1) Y¥Cisj) = u + CCCOITC YJ) — CO1)AC1,5)) PCI)d/g(DCisj)) 

WHERE 

“YCisJ) = the dependent variable, the estimated number 

Je of main destination day-use visitors, in 

hundreds, travelling to park j from origin i 

/ per season; 

| 

u = a mean calculated ina regression; 

T09) = attractiveness of park jj; 

C00), C(i) = scaling constants as determined by regression 

| methods; 

P( i ) = population, in thousands, of origin i; 

DC i,j) = road distance, in miles, from origin i to park 

) os 

ACisJj) = alternative factor of origin i, with respect 
to park j; as defined in Cheung (1970) and 
where gD(i,j) has two alternative formulations 
A and Bs: 

DC isj) 2008 DBGied aSn55 
DC ig j )**3/2 55 S$ Dli,sJ) 


OR 


The A and B formulations account for the different results 
presented Latere 
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Equation 1 is a modified form ot the main destination 
day-use model developed by Cheung using CORD Study Park User 
Survey data for Saskatchewan (for details on the data see TN 
1 and the CORD Study Data Collection and Documentation 
Volume )e 

The reasons that the Cheung model was not used per se 
in making predictions for the two Ontario sites were that 
(1) relevant park characteristics data such as that given in 
Table 1 for the construction of the attractiveness variable 
as detined by Cheung (TN Ul) were not available, and (2) it 
was believed that attractiveness would have a multiplicative 
effect rather than an additive one on the volume of uSee 
Cheuneg's original model, as is seen in Equation 1l, suggested 
a basically additive effecte This suggested that a modified 
"Cheung model" of the following form be used: 


(2) YCisj) = (CCOCOITC JICCL — COCLIACI,j)IPCid))/g(Dlisj)) 


Assuming that Cheung's constant term u has an effect of 
less than 1% as it did in predicting use of parks in 
Saskatchewan, it can be ignorede And omitting the 
attractiveness term because it is considered that it should 
be a multiplicative factor as indicated in Equation 2 (T(j) 
is attractiveness), one can proceed to develop a new modele 
Assuming, turther, that there are (on the average ) four 
people ina wvehicle that enters a park, one obtains, by some 
algebraic manipulations of Cheung's equation from TN i: 


(3) Y¥Cigj) = 481-624 (1 — O03 ACiszj)IPCIid/Dlisj) 


Certain data are required before Equation 3 can be used 
to calculate visitor flowse Origin-destination figures were 
obtained from the Ontario Department of Lands and ForestSse 
Since 1971 census figures were not available at the time 
that this research was carried out, 1971 population figures 
were estimated from census data by using the following 
equation: 


(4) PC 1971) 


if) 


(P(1966))2*/P( 1961 ) 


WHERE PCy) 


population in year y 


This is obviously based on the assumption that the change in 
porulation between 1566 and i971 would be the same as the 
increase between 1961 and 1966.6 It was recognized that this 
approximation was not very good but given other much more 
major inaccuracies, even using 1966 census figures would 
have been a minor source of errore 

The preceding discussion has not been about a model 
that allows a park's total attendance to be estimatede It 
may already be clear to the reader that the model gust 
introduced is one that is used ocnly to calculate main 
aestination day-usee The total estimated use at a proposed 
site consists ot day-use (which is sub-divided into main 
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destination day-use and non-main destination day-—use ) and 
overnight—usee For the reasons indicated in the Introduction 
it is important to have estimates for these other classes of 
usee The problem facing the researcher or planner is, if one 
has only a day-use model and no time to develop other 
regression models, how does one obtain “scientific" 
estimates of the other types of use? 

To this end it was necessary to use information on the 
ratio of main destination day~-use to other usese Camper 
registration figures for main destination Ontario provincial 
parks for the period 1968 —- 1970 indicate that the ratio of 
day-use visits to overnight-use visits was approximately six 
to one (see Table 1)e The number of overnight users, 
therefore, could he estimated once the number of day-users 
was known or estimatede The equations for the calculation of 
the total number of visitors and the total number of visitor 
days used were as follows: 


(5) Total visitors = day visitors plus overnight visitors, 


OR 


(6) Total visitors = day visitors plus 1/6 (day visitors), 
AND 


(7) Total visitor days = (day visitors * 1)t+ (average 
length of overnight stay * estimated number of 
overnight visitors )e 


It was also recognized that there was a need to 
recognize the Load that visitors place on a park by the 
differentials which are clearly noticeble in terms of an 
increasing average Length of stay for campers from about 265 
to 4 days from south to northe From Table 2 it may be noted 
that an average increase of 20 percent in camping stay 
occurred for Northern Ontario parks from 1968 — 1969.5 In the 
area being considered, it seemed reasonable to accept that a 
10 percent per year increase in average length of stay for 
campers is to be expected as an average for all sites (see 
Table 2), at least for the next few yearse 


Now Equations 6 and 7 can be used for projection into 
the future if three rates are specified: 


le Per annum rate of increase of day-use = RIDV 

Ze Per annum rate of increase of overnight-—use = RICV 

3e Per annum rate of increase in average length of camper 
stay = RIALCS 


The equations based on a continued constant percent 
growth for 5 to 10 years are ;: 


(8) Total visitors (1971 + t) = (day-use) * (RIDV +1)**t * 
(overnight-—use) * (RICV + 1)**t 
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TABLE 1 


STATISTICS ON SOME ONTARIO “"MAIN-—DESTINATION" PARKS* 


Day-use Length of Day-use 


Picnic Day-—use Parking 

Name of Park Acreage Tables Beach Acreage 
Oastier Lake 718 165 300 fte 220 
Arrowhead 1652 98 900 2e5 
Sturgeon Bay 35 KI «x %* 
Kitlarney 84350 50 600 KK 
Grundy Lake 6313 §52 800 8-0 
Restoule 1635 KK * %* x 
Mikisew 420 90 1500 2e5 
Killbear Point 4340 489 800 1-5 

Number Number of 

of Developed Day—-use/Campers 

Day—Use Campsites 

Parking 
Name of Park Spaces 1968 1969 1970 1968 1969 1970 
Oastier Lake 250 170 170 183 8e3 726 5Se7 
Arrowhead 210 102 253 253 308 4.6 4e7 
Sturgeon Bay KK 87 89 87 5206 3e6 8e7 
Killarney 100 102 125 160 13-5 99 8e3 
Grundy Lake S500 §37 537 540 209 4.5 4.2 
Restoule KK 229 229 225 207 3e6 4e7 
Mikisew 226 256 256 256 202 5e4 5e3 
Killbear Point 100 878 939 1018 4.6 606 Sel 

Average 5-5 Se7 5-8 


* Being a main destination park refers to having over 50% 
main-destination visitors as reported by the province of 
Cntarioe 


** Unknowne 


<>< >< ><> <> 

(S) Total visitor days (1971 + t) = (day-use) * (RIDV +1)**t 
* (overnight-use) * (RICV + 1)**t * (RIALCS +1 )**+t 

Data fprovided by the province ot Ontario containea the 


necessary tigures to calculate the needed ratesSe Data in 
Tables 2 and 3 show why RIDV = .069, RICV = e113 and 
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TABLE 2 


AVERAGE LENGTH OF STAY — SUMMARY STATISTICS 


Some Southern Ontario Average Length of Stay 
Provincial Parks in Number of Days 
1968 1969 1970 
Holiday Beach 23 2e2 202 
Ipperwash 4e4 4e9 520 
Long Point 208 Z2el 2ed 
Pinery del 2e3 262 
Rock Point 1.6 1.4 1.6 
Rondeau 204 320 2e2 
Selkirk 2e0 1.7 200 
Turkey Point 1.8 1e7 1-8 
Wheatley 168. 2e2 205 
Craigleith 1-9 1.5 200 
Inverhuron 2e4 208 3e7 
Point Farms 2eli 2e4 2e4. 
Sauble Falls 2e3 1.7 2e3 
Bass Lake 1.9 2e2 1.9 
Devil's Glen 1-5 Lied 1.03 
Earl Rowe 2el 2el 2Zel 
Mara 1.6 1-9 200 
Sibbald Point 208 Zed 2ed 
Six Mile Lake 1e9 1.7 220 
Average 204 2e3 204% 
%® Average Annual Change for South = 02.0% 
Some Northern Ontario Average Length of Stay 
Provincial Parks in Number of Days 
1968 1969 1970 
Arrowhead 1.7 202 205 
Grundy Lake 200 320 3-0 
Kiltlbear Point 3e2 407 5e2 
Mikisew 200 Ge J3e8 
Oastler Lake 2e1 206 2e7 
Restoule 25 4e2 4.5 
Sturgeon Bay 20 302 303 
Missinaibi Lake med 3e7 308 
Wakami Lake - Sel 5e3 
Killarney 4.0 2e3 309 
Windy Lake Jel 1.8 320 
Average 2e5 Jed 3e7 
%® Average Change 32% 
(1968 —- 1969) 
%® Average Change 12% 
(1969 - 1970) 
%® Average Annual Change for North = 22% 
"Over—all Annual Average" = RIALCS = (0c0 + 222)/2 = 211(11%) 


OL OLO LOLS, 
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KIALCS=e11 were usede 

However, given the equations that have been derived, a 
variety of figures may be calculatede Table 4(A §& _ B) 
presents the predicted 1971 total visitors and total visitor 
days for Georgian Bay and French River Mouthe Formula A 
refers to use prediction obtained from the use of g(A)D(i,;j) 
as detined tor Equation Ile Formula 8B similarly reters to 
results obtained from yg(B)DCiyj)Je The total use figures 
arrived at by summing up various columns in Table 4(A & B) 
provided a starting point for extrapolation to future usee 

A constraint should be kept in mind on the procedure 
described heree The use of the rates cited toa this point 
assumes that projections do not exceed the capacity of the 
Site being considerede If capacity is not a factor, the 
aprlication of Equations 8 and 9 result in the curves shown 
in Figure le Originally there were four parts to Figure ie 
This was because it was deemed necessary to present results 
based on the use of both Formula A and Formula B (the 
different distance functions introduced earlier)e In each 
figure, a manager or planner was able to see the Load that 
will occur at the gate of a Park on the basis of number of 
visitors ana also the load that will be on the Fark in terms 
of day-use and oumbher of camper dayse The four curves were 
used to communicate different information that (in total) 
Bives the planner and/or manager a better basis on which to 
formulate his views on how the Park should be designed and 
how it should operatee If only one cf these curves were 
presented to managers, aS indicated earlier, it would be 
imrossible to recognize how a balance should be struck 
between developing day-vuse areas and developing camp,z:roundse 
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FICHRE 1 


GEORGIAN BAY 
i | (FORMULA “A") 


TOTAL SEASON USE 


r / 


No. of visitor 
17; . @ gays (at average 
stay of 2.5 day) 


14 No. of visitor 
days (at average 
stay of 3 days) ° 
13 
® No. of Camper days at 
Wee fs average Stay of2,5 days 
per party 


NUMBER OF VISITORS OR VISITOR DAYS (00,000) 
(As indicated on curves) 
oO 


& 

a oe No, of visitors 
10 Sa be 

r) @ 
9 @ _?* No. of campers 
i — or _lbe 
P| igre etnies tty of ge ea 

eee days per party — em 

6 Bose pene Se No, of day users 


197) 1972 1973 1974 e775 1976 ASF 7 NP 78 iv79 2280 «1981 


YEAR 
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TABLE 3 


CHANGE IN DAY-USE 
AND CHANGE IN CAMPER USE: 
SUMMARY STATISTICS 


Total Season %® Change Total Season %® Change 
Year Day-Use in Day-use Camper Use in Camp Use 
1960 5 2692,578 5$2,103 
12961 5,352,811 —-6e0 862,559 456 
1962 6,757,867 " 26e2 1,063,127 236 
1963 72685,952 bulla A 9 3 840,491 —-206 
1964 8,230,937 606 916,281 Ge 
1965 7,973,196 —3el 902,472 —le 
1966 5,796,884 10-3 994,787 10 
1967 S,037,442 2ZeJ] 1,155,091 166 
196% 2,089,297 9e4 1,300,639 “ie 
1970 10,640,726 169 1,531,528 12. 

Average 609 Average lie 

RIDV = e069 RICU = e113 
POT OTOTOLS: 
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DISCUSSION 


Understanding what the modified Cheung model implies 
about attractiveness is important in evaluating whether the 
model proposed should or should not be usede To test whether 
the model did appear to be appropriate to the Ontario 
situation, use figures were generated for several Ontario 
Parks and these were compared with the use figures actually 
observed at these Parkse This gave a picture of whether the 
model is appropriate in terms of showing that the Parks in 
Ontario appear to follow roughly the model established based 
on Saskatchewan datae 

One should note that the Ontario Parks that were 
considered and compared with the proposed Parks were not 
operating at capacitye When one examines Figure i, one sees 
a drastic increase in the use of the proposed sites being 
predictede Now, obviously the parks should not be opened 
with their 1981 capacity if they are to be opened in 1971 or 
1972.6 Rather an approach should be taken that opens a 
certain amount of capacity to see if the use projections 
appear to be following the appropriate trajectorye 

One point that cannot be stressed enough is that all of 
the estimates made in this article are subject to a great 
deal of errore Park planners and designers should not take 
the figures as gospel truth but take them as (at best) ina 
plus or minus 50% estimatee Given this fact an approach to 
planning called Discreet Incrementalism appears to be 
appropriatee Under this procedure a development plan is 
established but, at the same time, an evaluation program is 
set up in parallel so that the development plan can be 
modified if use estimates are not following the plan 
trajectory or ify in some other ways; the park is not 
operating as it was planned that it would operatee 

Certainly, when planning tor a park is done based on 
the kind of computations illustrated in this note, it is 
critical to set up a use monitoring plan so that based on 
revised use estimates when the park actually opens, one sets 
up a scheme for managers and planners that will allow them 
to know if the park is having the use levels expectede If 
there were other objectives for the park (such as regional 
development etce ) information about the opening of 
enterprises, data on total sales etce should be accumulated 
so that the goals for the park do not become lost in a 
procedure that sees an original intent handed over for 
implementation to people who may not know what that intent 
was and therefore may not be in a position to understand 
what is not happening that should be happeninge This 
particular theme is taken up in more detail in CORD Study TN 
40. 

The capacity of sites presents special problems’ tc 
modelling effortse Examination of Ontario's data for 
Killbear and other sites (Table 6) suggests the guidelines 
given in Table 7e The figures in Table 7 assume a ratio of 6 
to 1 day-use visitors to camperse 

Table J provides guidelines that should be consiaered 
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CHAPTER II 


TN 


TABLE 6 


SOME FIGURES RELEVANT TO CAPACITY 
( !Per Season or Season Average ) 


July — August 1970 


| % Occupancy Number of 
Name of Park * Noe Campsites! Campers! Ratio(1)! 
Oastler 181 19,424 107-3 
_ Arrowhead 124 14,279 1152.2 
Sturgeon Bay 84 72682 91.5 
Killarney 102 7,559 7401 
Grundy Lake ena 7a 35,656 956 
- Restoule 108 th 62620 6123 
Mikisew 136 10,166 1428 
Kiltbear Point 753 42,700 5667 
Noe of: Day-Use Number of 
Name of Park Picnic Tables Day—Users! Ratio( 2)! 
Oastler 165 110,698 670-9 
Arrowhead 98 67,644 69022 
Sturgeon Bay Aa 67,109 
Killarney 50 62,762 1,255¢e2 
Grundy Lake Soe 150,564 272¢8 
Restoule eK 31,063 
Mikisew 90 | 54,371 604.1 


(1) (Number of Campers ) 


%® (Occupancy) (Number Campsites) . 


(2) (Number of Day-Users ) 


(Number of Day-Use Picnic Tables) 


<> <>< ><> <> 

reasonable capacity relations for use that tends to be at 
least 10% below design capacitye If a park runs at capacity, 
traffic considerations, crowding and other factor may affect 
usee It must be emphasized that the number of sites given in 
Table 7 should not be regarded as a design guidelinee Care 
must be taken to ensure that "natural" and other biological 
balances are not upset by the creation of park facilitiese 

Given Killbear and other parks as providing back-up 
capacity for the proposed sites, one should expect to reach 
a balance between expected use and the capacity of the site 
by accepting the fact that biological and social 
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TABLE 7 


CAPACITY CONSIDERATIONS 


#Campsites #Campsites 

Assuming Assuming 
Total 4e5 Days 3 Days 
Season Average Average #Day—-Use 
Visitation Stay Stay "Sites" 
500,000 1,800 1,200 1,250 
400,000 1,500 1,000 1,000 
300,000 1,250 830 750 
250,000 1,000 667 625 
200,000 750 500 500 
100,000 Belg 225 250 


* The above assumes a ratio of 6/1 otf day-user visitors to 


iN 


camperse The figures for day-use sites are not easiiy 
interpreted, but reflect "turn-over" and the fact that 
some day-users dado not use "developed" sites or "double 
up" @ 
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considerations should define a capacity which should not be 
exceedede If use predictions exceed the design capacity, and 
if the park is to be a stable ecological entity, then use 
must be redistributed to conform to capacitye 


CONCLUSION 


This paper has traced the steps that have resulted in 
the estimation of various park use figures and suggestions 
that planners and policy makers have found usefule Data were 
supplied to planners to help ensure that the "best" answer 
to a planning problem that could be derived at the time was 
arrived at with full knowledge of the problems facing the 
researchers as they attempted to apply the model discussed 
in this papere 

It is primarily in the spirit of seeking reaction to 
their research philosophy presented that this paper was 
writtene The greatest value that may come from it is that 
the critical and constructive reaction should be a response 
to what figures policy makers really neede It is certainly 
easy to attack the paper on the basis that (1) a 
Saskatchewan day-use model was applied to Ontario, (2) that 
adequate consideration was not given to the breakdown 
between weekend and weekday use, and (3) that proper 
consideration was not given to the effect of the "baby boom" 
and other demographic forces on the growth of park use in 
Ontarioe To focus on these problems, however, is to miss the 
more important points and perspectives which are contained 
in this studye Planners, managers and policy makers often 
phrase questions regarding development options in casual 
imprecise forms that vary with each proposed developmente 
More reliable, valid and useful data could be obtained if 
the data collection and analysis of various use-figures were 
standardized considering real needse The authors believe 
that it is the responsibility of the researcher to comment 
on the practical significance of the numbers he arrives at 
and to guide policy makers and planners to ask for the right 
information: the researcher should not just react directly 
to requests but should also understand policy and planning 
well enough to guide planners and managers away from 
information they should not use, as well as guide them 
towards intormation that they should use (for reasons that 
totlow logically from objectives )e 


™N 7 CHAPTER II page 75 


CORD STUDY TECHNICAL NOTE 18 
A METHOD FOR PREDICTING ENROUTE OVERNIGHT PARK USE 


HeKe Cheung? Se Smithy, Je Beaman 


ABSTRACT 


In this paper a regression model is presented for 
predicting overnight use at a park where campers are of the 
enroute typee These are campers who are enroute to 
somewhere other than where they stope 

In the multiplicative model developed, major arterial 
highways that are close to parks being studied were 
considered to be “sources of visitors". The level of traffic 
flow was used to define the “origin population"e Two other 
explanatory variables were employed in the model —-—- the 
number of campsites and the road distance between highway 
and parke In planning for a new park or changing an existing 
one these latter variables are suggested to be ones which; 
in selecting and developing a park, can be manipulated by a 
park planner to influence the amount of use a park witl 
receivee 

When the model parameters were estimated using linear 
regression (after a Logarithmic transformation of the 
multiplicative model) all regression coefficients were found 
to be significant at the five percent probability level 
according to the F-test and all had the expected signse The 
root mean square percentage error in predictions for 
individual sites obtained was 22-59 percent, indicating 
that, at least according to this measure, the fit was goode 


PURPOSE 


The purpose of this note is to report on research that 
has resulted from attempts to derive a predictive equation 
for enroute camping use in Northern Ontarioe The equation 
derived is especially relevant to park planning because some 
of the explanatory variables used in its derivation are 
planning variables in the sense that a park planner may be 
able to manipulate them to influence the volume of use that 
a park receivese 


INTRODUCTION 


Generally speaking, park users can be classified into 
four basic types: (1) main destination day-users, (2) 
enroute day-users, (3) main destination overnight users and 
(4) enroute overnight users (for further elaboration of 
these classes see TN 8 and 30)e An enroute overnight user; 
which is the one of concern here, iS a camper who stays at a 
park for one night and then moves one 

The usefulness of disaggregating park users has been 
pointed out by Pankey and Johnston (see Reference 27)e For 
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example, ina reservoir study they found that the root mean 
square (RMS) error of a linear equation for predicting total 
use (day and overnight) was 114. This was reduced when total 
use was broken up into day-use (RMS = 73) and overnight use 
(RMS = 80)e They concluded that greater precision could be 
obtained when a researcher predicted day and overnight use 
separately and then summed them to obtain total uses 
The point of pursuing the visitor classification issues 
raised above is that the model described in-this paper (and 
applied herein to twenty provincial parks) is the outgrowth 
of one developed to forecast total use of Pukaskwa National 
Parke Pukaskwa is situated in a relatively isolated region 
in Northern Ontario (see Figure 1), about 192 road miles 
from Sault Stee Marie and 220 miles from Thunder Baye Each 
of these two distances includes a 12-mile Link of paved road 
connecting the park to the Trans-Canada Highwaye The park's 
tocation suggests that it would not generally be used as a 
main destination as is the case with many other Canadian 
parkse Indeed, an..examination of the. Ontario Camper 
Statistics (1971 Ontario Provinciat Parks Statistical 
Report) revealed that most of the provincial parks in 
Northern Ontario had more enroute campers . than main 
destination camperse Clearly, Pukaskwa's visitors would 
include many of those who travel the Trans-Canada Highway 
and who, while on their trip; would deviate, say 10 to 30 
miles to the park as a convenient place to stop overe 

By way of further introduction, the reader may wish to 
note briezily how the model presented here evolvede The 
original request by planners for assistance in developing a 
plan resulted in an early attempt to define a modele 
Planners were asked about their development ideas so that it 
would be possible to decide what types of users’ the 
researchers should considere When it was recognized that the 
park would be established with the main stopping area 
approximately 25 mites from a major road, it became clear 
that it would be necessary to concentrate on enroute camping 
use estimatese It was not considered likely that people 
would merely drive into the main stopping area of the park 
to say they had seen the park when it would take them 50 
miles out of their waye The researchers and planners also 
noted that there were no large population centres within 200 
miles of the parke The primary source of visitors would be 
the traffic on the Trans-Canada Highwaye 

An important concern of the planners and park managers 
was the percentage of visitors who would be looking just for 
overnight accommodations as opposed to those who would be 
looking for a wilderness experience, which was the main 
purpose for establishing the parke 

These users who (in terms of TN 23) made only partly 
desirable use of the park were important for two reasons: 
(1) the park has an important but secondary objective of 
regional development, and(2) it is these people for whom 
there was a need to plan because, in Line with not 
preserving the park for a few hardy wilderness users, it 
becomes necessary to provide the space and facilities that 
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will encourage some enroute visitors to further investigate 
how they can make use of National Parks (the education and 
interpretation objective)e There are some data available 
from other parks which are in similar situations, so it 
would have been possible to elaborate the modele However the 
researchers felt that it was adequate to pursue this issue 
of the number of interior wilderness use sites intuitivelye 
The most important concern was to make sure that there would 
be sufficient capacity in the park so that the number of 
Sites for the long-term visitors in the interior of the park 
could be expanded to meet demand as the use of the park 
increasede An initial capacity that seemed reasonably 
adequate to accommodate this type of user when the park was 
first open was already proposed in the parks draft master 
plane So there seemed Little merit in spending additional 
time and money to further define the need for wilderness 
sitese 


THEORETICAL CONSIDERATIONS 


Given a specitic interest in developing an 
enroute camper model, a conceptual problem of model 
formulation is one of deciding the key variables affecting 
people's choice to stop over at a given parke A variable 
that is obviously important is the distance which people 
have to travel from their main route to reach the stopover 
Parke How many people will deviate more than 15 miles from 
their main route if they do not intend to stop for any 
length of time at a given park but are, in fact, heading for 
some distant main destination? How many people will go even 
five miles out ot their way on a gravel or poorly 
conditioned road? Clearly, both the distance and the 
condition of road should influence greatly one's decision to 
make an enroute visit to a parke 

So it is reasonable to suggest that a major highway at 
a certain distance from a park serves as a generator of 
visitors in much the Same way that a city ata given 
distance from a park serves as a generator of visitors (see 
TN SCO and Reterence 28 )e The higher the weekday, weekend; 
(etce, depending on circumstances ) annual average summer 
traffic volume, the more visitors one may expect to Leave 
the highway anaq make an enroute visit to a Biven parke But 
there will be a certain volume of traffic that is commercial 
or business travel that will not stop at a parke Here it is 
assumed that the portion of potential park visitors in that 
flow of traffic is available from traffic partition countse 
To avoid unnecessarily awkward descriptions of variables, 
one may refer to the traffic on a highway and mean it to be 
an estimate of the volume of the total traffic that is 
potential visitorse 

Another variable considered to be important in 
influencing the decision to make an en route visit to a park 
is the number of campsites the park hase Almost certainly, 
the probability of people deviating trom the main route of 


TH 13 CHAPTER 11 page 79 


their trip will depend on the probability of finding 
accommodation at the stopover parke It is reasonable to 
suggest that the chance that a site is available is higher 
at a park with a large number of campsites than at a park 
with a small number of campsitese 

Cheung (see TIN 1) has suggested that the facilities at 
a park influence its attractiveness, and that an 
attractiveness index based on the quantity and quality of 
the facilities could be used as a variable to predict park 
usee Both he and Beaman (TN 9) have suggested that’ the 
existence of other parks around a given park affects use 
either by modifying attractiveness or in some other way (see 
TN 11 and 33)e One may consider the features at a park and 
alternative recreation opportunities as important variables 
affecting park attendancee 

Given all of the preceding, enroute camping use of a 
park is hypothesized to be a function of highway traffic 
counts, the number of campsites of a park under study; and 
the road distance between this park and one or more 
highwayse Variables such as road condition, park 
attractiveness and alternative parks were excluded in this 
analysis because of a lack of datae However these variables 
are not expected to affect the R2 values substantially 
because (1) most of the parks studied have gravel roads 
connecting them to a major highway; (2) the specific concern 
in developing an enroute camper model was with an isolated 
Site so alternative parks have negligible influence on park 
use; and (3) enroute campers perhaps have Little concern for 
anything beyond having an acceptable campsite relatively 
easily accessible, with certain amenities such as drinking 
water and flush toiletse Regarding (3) the same basic 
amenities are provided in all the sites considered because 
they conform to Ontario Provincial Park Standardse 


DATA 


Data used in defining the dependent variable, the 
number of enroute campers visiting a park during a season, 
were drawn from the 1971 Ontario Camper Statistics (OCS) 
compiled by the Ontario Ministry of Natural Resources (see 
Reference 26)e The OCS included information on type of 
camperS, average party size, and length of staye Since most 
of the parks selected for this study had more enroute use 
than main destination use, these numbers of campers’ were 
regarded, for practical purposes, as if they were all 
enroute camperse 

Traffic count data used as the homolog of origin 
population were the annual average daily traffic (see 
Reference 24-e) Information on the number of campsites ina 
park was obtained from the 1971 Ontario Camper Surveye Road 
distances between major highways and parks were obtained 
from an Ontario highway mape 
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METHOD 


Twenty provincial parks in northern Ontario were used 


in the analysise They are tlocated near major arterial 
highways and provide similar facilities such as drinking 
water, flush toilets and fireplacese Fifteen sites were 
actually used in developing the model; the remaining five 
were used for model validatione 


The functional form of the model chosen for analysis of 


the use of the fifteen parks was: 


m1) VO j)=b01 CTC j *¥*¥b02)) (SC 5 )*¥*D(3)) expl b(4)D0j)) ul j) 


WHERE 


'vV(Cj) = the dependent variable, which is the observed number 

ot "enroute" campers visiting park j during a 
Season, 

T( j) = annual average daily traffic count on a section of 

the arterial highway between two intersections 


Leading to park jy 
S(j) = the number of developed campsites in park jy, 


D( j) = the shortest road distance from the arterial highway 
to the entrance of park J+, 


u(j) = an error term, and 


b( i) = parameters, i= 1,» 4-6 


The multiplicative model defined by Equation 1 was 
changed into a Linear one by using logarithmic 
transformation that allowed the use of linear regression 
analysis to estimate the parameters of the equatione The 
estimated equation was: 


(2) tn€ VO j))=3e22 +t O637 tnT(j) + 0280 nS j) —- 0-039D( Jj) 


RESULTS 


The R® standard error of estimate and the F-value for 
Equation 2 are O0e89, 0027, and 31-17 respectivelye The 
equation derived is significant at the e001 lLevele The 
repression coefficients, their standard errors, and the 
increase in R?® value at each step of the stepwise regression 
analysis are presented in Table le The coefficients of the 
model are significant at the five percent probability level 
according to the F-test, and all have the expected signse 

It is sometimes the case that the relative importance 
of the explanatory variables is suggested by the order in 
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TABLE 1 


SUMMARY OF THE VARIABLES ENTERED INTO THE REGRESSION 
EQUATION BY STEP 


Regre Regre Standard c Increase 
Step Variable Coeff Error Re in R2 
1 Number of devel-—- 0.80 0214 0-70 0-70 
oped Campsites 
2 Traffic Count on 0637 0-14 0e 83 0213 
Arterial Highway 
3 Distance from —-0-03 0-01 0-89 0-06 
Arterial Highway 
to Park 
<>< ><> D><K> 


which they enter in a stepwise regressione Also there is 
some merit in discussing the increase in the amount of 
variance in IlinV(j) each explained when each new variable is 
addede From the convention perspective taken in interpreting 
stepwise regression results the number of developed 
campsites, a variable reflecting the capacity of a park, is 
the most influential variable of the three explanatory 
variablese The coefficient of the variable is 0e8, implying 
that a 10 percent increase in the number of campsites of a 
2iven park will result in an eight percent increase in use, 
ceteris paribuse Essentially, what the coefficient means is 
that attendance increases as the number of campsites 
increases, but at a decreasing rate (the problem with this 
interpretation is commented on subsequentlLy )e 

The fact that the coefficient of the variable, "traftic 
counts" (0-37) is much less than unity may be taken to imply 
that use does not increase indefinitely as traffic volume 
increasese 

The distance variable entered the equation last and 
explained six percent of the variation in the transformed 
dependent variablee The relatively small explanatory power 
of distance as a predictor variable in this model was 
expected because the average distance from an arterial 
highway to the entrance of one of the 15 parks used to 
develop the equation is only about four milese 
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DISCUSSION 


: Measuring how well a regression explains data involves 
a number of complex issues (see TN 19 and 35) but certainly 
€@ measure such as % RMSE that gives an idea of the average 
percent error in estimates is a useful guidee Here the 
percent root mean square error (% RMSE) is defined as; 


®RMSE = [1/n OE (% error in prediction at park(j )2@ ) ]**1/2 


| WHERE 

(Predicted Number of _ (Observed Number of 
|% error in Campers at Park j ) Campers at Park j ) 
prediction = —---~—---—--——-—-——--~—--——-—-~---——--—-———~-—---—-+*100 
-at park Jj : (Observed Number of 


Campers at Park j ) 


One may note that the percent root mean square error is 
also called the standard deviation of the percent error of 
prediction (see Reference 17. ) 

The 22.59 percent root mean square error (% RMSE) 
obtained indicates that the fit was relatively goode (On 
expected error and structural adequacy of models see TN i19 
-and 35-e) The components of % RMSE, the percent error of 
prediction for the individual parks, varied from -5-54 to 
36663 (see Table 2)e Some of these large errors may be 
caused by the absence of variables accounting for the 
effects of alternative parks and park attractivenesSe In 
particular, the over-prediction of total season camping at 
Aaron may have stemmed from the failure to weigh the 
importance of the three competing parks lying close to it, 
namely, Sandbar Lake to the east, Blue Lake to the west, and 
Ojibway to the northe On the other hand, the underprediction 
|'for Neys and Sandbar may have arisen from ignoring the fact 
that the former is the only park among those studies that 
has boats for hire, and that the latter contains an 
excellent fishing areae The situation in the latter case is 
further complicated by the failure to recognize that not all 
enroute campers are the same in their preferences for a 
 campgrounde Some may be strongly oriented towards some 
activity and thus perceive a different attractiveness for a 
park than those who are just Looking for a place to sleepe A 
site with something "special" to offer can be expected to 
attract either some significant use as a main destination or 
use from activity-oriented enroute campers as well as use 
from the more ordinary enroute campere 
| Thus far the model has been evaluated in terms of the 
plausibility and significance of the regression coefficients 
and the accuracy with which it can be used to make 
predictions for fifteen provincial parkse To further assess 
the predictive utility of the model, it was applied to the 
five parks that had originally been set aside for 
validatione 
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TABLE 2 


OBSERVED AND PREDICTED VALUES AND PERCENT ERRORS 
OF PREDICTION FOR THE PARKS USED 
IN DERIVING THE VISITATION MODEL 


Observed 1971 Predicted 1971 Error 
Park Total Campers Total Campers (%) 
Lake of the Woods 5,695 5,700 0-08 
Invanhoe Lake 12,688 12,780 0-72 
Pancake Bay 38,621 35,579 —- 7288 
Obatanga 13,468 17,080 26282 
White Lake 25,316 27,853 10.02 
Aaron 10,837 14,205 31-08 
Greenwater 4,949 6,762 36-63 
Klotz Lake 37688 3,406 —- 7-63 
Blacksand 5,104 6,945 36-06 
MacLeod 72931 75240 - 8e71 
Neys 21,917 15,630 —-28-68 
Rainbow Falls 30,441 28,314 —- 699 
Sandbar 9,093 6,641 —-26-96 
Sioux Narrows 5,821 5,499 - 5.54 
Inwood 18,072 1L,310 —-37.42 
Root mean square error = 22-59% 
SOT OTOTLOLO. 

The results presented in Table 3 are further evidence that 
the model can be used for predicting levels of enroute use 
at predicted park sitese 


CONCLUSIONS 


The conceptual framework proposed 
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TABLE 3 


OBSERVED AND PREDICTED VALUES AND PERCENT ERRORS OF 
PREDICTION FOR THE PARKS USED IN TESTING THE 
PREDICTIVE ABILITY OF THE VISITATION MODEL 


Observed 1971 Predicted 1971 Error 
Park Total Campers Total Campers (% ) 

Lake Superior 42,020 46,777 11.32 
Wakami 4,183 3,872 —- 743 
| Quetico 15,364 15,034 - 22015 
| Kettle Lake 13,994 17,578 25261 
| Nagagamisis 4,529 4,860 Te32 
Root mean square error = 13-40% 


of a park, in terms of camping use, is knowne If one assumes 
'that the "physical capacity" of a site is measured in terms 
of campsites that can be developed (for further discussion 
of carrying capacity in this context see TN 16 and Reference 
22): one can use Equation i to determine the user-days of 
use to be expected to see if they can be developed without 
exceeding the social carrying or biological capacity of the 
Site based on the level of management expenditure to be 
devoted to maintaining the site ina “sustained yield" 
condition (see TN 16 )e 

Nevertheless, the use of the number of campsites as an 
explanatory variable in a regression equation causes concern 
for some outdoor recreation researcherse Their legitimate 
concern is related to what is called a “circularity 
problem". In essence, they argue that more campsites cause 
more uSey, which results in more campsites being developed 
(see Figure 6 in TN 40)- It is impossible, they feel, to 
resolve this "chicken-egg" cycle ina way that justifies 
using the equation in the way described abovee The author 
argues that correlations which when studied using regression 
analysis give the equation determined should not be read to 
imply a cause and effect relationshipe It may be found that 
for established campgrounds there is an equilibrium 
condition reached which puts an end to “circularity": here 
because of the need to reach some conclusion the equation is 
treated as defining some kind of an equilibrium condition 
that “would eventually be reached", not as indicating 
anything about the dynamics of how the equilibrium would be 
reachede 

Other independent studies have also shown the 
importance of the number of campsites on the attractiveness 
of parkse Cesario (TN 4) found that the number of campsites 
in a park was the major component of attractivenesse Should 
a researcher be concerned about using Cesario attractivities 
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because using them in some way implies that the number of 
campsites is used as an independent variable explaining camp 
use? Perhaps, but if the Cesario attractiveness do not seem 
to be useful for park use predictions what will be? The 
point is that there may be some problems with the model 
developed but, on the whole, there are many positive 
featurese It is simple and requires relatively simple data 
to yield quite accurate estimates for other sites not used 
during the model's developmente With appropriate projections 
of the highway traffic variable, and given accurate values 
of the other two variables, predictions can be made and 
there is good reason to think that they will be reasonably 
accuratee Clearly, the use of the model for prediction 
assumes that the Parameters are accurately estimated and 
that they will not change appreciably during the prediction 
period and that the effects of alternative parks and park 
attractiveness have been correctly considered (see TN 237 )e 
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CORD STUDY TECHNICAL NOTE 30 


A WORK PLAN FOR 
THE DEVELOPMENT OF A MATHEMATICAL MODEL TO PREDICT 
AND EXPLAIN OVERNIGHT USE OF PARKS 


Je Beaman, Se Smith, and HeKe Cheung 


(based upon work by 
HeAe Macdonald, AeHe Netherton, and FeJe Cesario 


ABSTRACT 


| This paper is about developing an overnight use modet 
by using data collected in park visitor surveyse The 
mathematical models presented are ultimately meant to assist 
park planners to predict the number of visitors who will 
stay one night or more at new or modified parkse 

In the report, which is based ona proposal for a 
study, such items in the original report as a list of tasks 
and awork plan flow chart are omittede Only modelling 
considerations for actually developing overnight camping 
models are kepte As well, the material from the report "A 
Work Plan for the Development of a Mathematical Model to 
Predict and Explain Overnight Use of Parks", by MacDonald, 
Netherton and Cesario has been augmented by Smith and Beaman 
| to focus attention on matters which they considered need 
more discussion since the purpose of the new report was not 
the same as that of the original reporte 

| Incidentally, examination of the Saskatchewan CORD 
Study data and other potential data showed that these data 
were not good enough to merit the use of the kinds of models 
in the "Work Plan". This is the reason the "Work Plan" was 
never followede 


PURPOSE 


The purpose of this paper is to present considerations 
for the development of an “overnight use" model to estimate 
park visitor flowse Data requirements and operational forms 
of the model will be discussede The model developed will be 
developed in the specific context of campinge 


INTRODUCTION 
| The first model developed by the Canadian Outdoor 
Recreation Demand Study was Cheung's main-destination day- 


use model (see TN 1)e Cheung recognized that if a single 
equation was to be used to predict behaviour, that behaviour 
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had to be fairly homogeneous (see Reference 27)e The model 
also had to apply to specific facility types offering the 
Same recreation servicee Initially a simple day-use, single 
destination model seemed to offer the greatest chance of 
modelling successe This was the form of Cheung's final 
modele 

Much outdoor recreation behaviour, however, involves 
overnight stays; these activities (as opposed to day-use) 
tend to involve more travel and have greater environmental 
impact and economic ramificationse Thus it was desirable to 
researchers, planners and administrators that an overnight— 
use model be developede 


THEORETICAL CONSIDERATIONS 


Overnight users of parks may be divided into several 
categoriese Two of the more common classifications schemes 
include: (1) type of facility used (tent, trailer, pick-up 
camper, motor home, etce) and (2) travel characteristics 
(e€ege weekend )e This Latter classification scheme - travel 
characteristics - is probably the more useful for park 
planning purposeSse Frequently a park can handle a mix of 
camper vehicles and shelters without any significant 
maintenance ad justmentse For planning purposes, the 
Similarities between various campers grouped according to 
equipment styles are much greater than their differencese 
Important planning differences, however, do arise when one 
compares a transient camper, for example, with one staying 
at a site for a weeke 

This article suggests that, in general, campers can be 
divided into four rather diffuse groups: 


(1) transient or enroute 
(2) commuter 

(3) destination 

(4) quasi-second home 


(For further comments on camper classification see Reference 
4, 14, 2i1e) 


Iransient or Enroute Campers 


This type of camper is on a relatively long trip and 
uses a campground on a Single-night basise His travel 
pattern is one of "hopping" from one campground to anothere 
However, two different types of enroute campers can be 
identifiede One considers enroute campsites as rest stops on 
a trip between his origin and some destination point or 
regione For example, a camper from Ontario heading for 
Bantt National Park might consider stops on the Prairies 
primarily as rest stopse In this situation the Prairies 
would be viewed as an obstacle to be overcome as quickly as 
possiblee Enroute stops would be chosen that were as close 
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as possible to the fastest route from his origin to his 
destinatione 

Alternatively, an enroute camper might consider certain 
areas in the Prairies as attractive sub-destinations and 
would be willing to take a more indirect route in order to 
pass through or stop at these arease Different models would 
be required to predict the behaviour of these two types of 
enroute overnight camperse In the discussion that follows, 
onty the former will receive attention, since he is probably 
the more common type of enroute camper and is certainly the 
easier to models This type of enroute camper can be expected 
to look for a campground that is readily accessible to major 
through-routes and will accept minimal facility dewelopment 
(hook-ups for this equipment, a small store offering items 
such as flashlight batteries, postcards, aspirinsS, and a 
pool or small pond)e Since enroute campers in a campground 
may be from any section of Canada or the United States, the 
"source of visitors" for modelling purposes is the adjacent 
highway(s), and not one particular centre of populatione The 
relevant costs of enroute camping are the distance between 
the adjacent highway and the park and the charge for the 
overnight use of the campgrounde There are, of course, 
roadside campgrounds which cater exclusively to the 
transient or enroute campefre But many other parks, 
especially many provincial and state parks, also serve this 
functione 

A model incorporating primarily traffic volume on major 
highways adjacent to these parks and the number of 
alternative parks should be adequate to estimate the number 
of visitors to a roadside park (see TN Noe 18)e In the more 
complex case (e€ege provincial parks), total overnight use 
estimates require inputs from two or three use-equations for 
the other camper typese (See TN 40, Figures 4, 5, and 6e) 


Commuter Campers 


The commuter camper utilizes a park near his home and 
place of worke Typically, the family will stay at a park for 
several days while the wage-earner commutes daily to his 
place of employmente The average length of stay for this 
type of park use is two to four nightse The commuter camper 
looks for a facility that, in addition to being close to his 


workplace, offers an attractive environment with some 
development such as swimming and fishing facilitiese A 
"seneral store" is not always necessary, since the family 


can continue to follow its normal shopping patterns and 
replenish its supplies from homee 


Destination Campers 
Destination campers are probably the most diverse of 


the groups considered heree One individual may spend several 
days of his vacation as an enroute camper and then, upon 
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arrival in a major tourist area, become a "destination 


camper" for several dayse When he returns home, he may 
become an enroute camper againe If the destination 
campground offers attractions such as a body of watery, 
trails or beautiful scenery and a developed camp 
infrastructure (laundry , grocery store, other camper 


services) the camper might possibly stay a week or moree 

Another type of destination camper may have no single 
destination point in minde Instead he considers a region 
such as Nova Scotia and New Brunswick to be his destination, 
within which he plans a toure In this case his overnight 
use of any one campground will be a function of not only 
attractiveness of that particular site but probably more 
importantly of the surrounding area and indeed of the whole 
region being tourede (See the comments on the meanings of 
attractiveness measures in TN 9 andon alternative-site 
functions in TN 3e) In other words, use of a particular park 
might depend more on: regional characteristics than on the 
characteristics of the parke Thus a planning model for this 
kind of camper would have to predict the tour he choosese 
And, as if this isn't difficult enough, in many cases the 
utility of a tour is not simply equal to the sum of the 
utilities of its partse The latter situation might arise 
when acamper may wish a varied diet of activities and 
places to seee The result is that one activity (such asa 
day at the beach) may be more desirable (have greater 
utility) if it tollows a day's river fishing than it would 
if the camper did nothing else but camped and lay on the 
beache Because of these complications, the proposed model of 
overnight park use will be Limited to destination campers 
with a single main destination, rather than a set of 
destinations with a variety of activitiese 

A second major distinction among destination 

campers is the duration of their stays away from home or 
worke The main effect of trip duration is likely to be that 
distance between home and destination plays a more important 
role as trip duration decreasese In other words, the length 
of the trip becomes increasingly critical as a camper has 
less time for the tripe No structural differences; however, 
are assumed necessary to adapt the models to different trip 
durationse But, without information on the break-down of 
total trip durations for each origin and each destinationy 
it is difficult to predict the different trip durationse 
Calibrating a single model for all destination campers 
irrespective of trip duration is likely to produce 
predictions biased toward the mean or median trip lengthe In 
this regard Dice (see Reference 15) estimated that in 
Michigan 75 percent of campers camp on 25 percent of the 
days, primarily on weekends and public holidayse Such a high 
proportion of 2-day or 3-day trip durations suggests that it 
might not be too inappropriate, lacking more detailed data 
on trip duration, to perform only one calibration of an 
overnight destination model for Michigan using all overnight 
destination data&ae 

Destination campers are confronted with travel costs 
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| proportional to distance, road tolls, extra meals on the 
_road, and park feese Since distances are larger and play a 
more important role here than for enroute campers (who may 
have travelled just as far, but whose origin —- for planning 
purposes - is the highway ), income and other socio-economic 
data should be included in the destination camper travel 
_modele Other factors to include are, of course, the quantity 
and quality of facilities at and around a destination, and a 
measure of the effect of competing alternativese 


| Quasi-Second Home Campers 


This last group of campers is @ growing one, and one 
that is a fairly new phenomenone The group is characterized 
by the storage of their camping equipment at a site, usually 
within a day's drive of home, for a season or for the yeare 
They travel to the site for weekends and extended vacations 
throughout the yeare Their strategy frees them (1) of having 
to load up their equipment every time they wish to make a 
trip, and (2) of the problem of where to store a motor home 
or campere Since this group is still quite small, many parks 
do not permit year-round or seasonal storage, and their 
criteria for choosing a particular site may differ 
considerably from the other three gZroupse This group of 
campers is not considered in this studye 

The above discussion has given profiles for different 
types of campers; the question can be raised as to whether 
there are different types of campgroundse To an unknown 
extent, yeSe Even someone not connected with the design, 
development or operation of a campground would suspect that 
the Location and level of development of some facilities 
reflects the character of the usere For example, a 
campground located near a Limited access highway interchange 
with few trees, a small pond; a small store and no other 
development (except for individual campsites) is designed to 
serve the enroute travellere A campground in a remote park 
with excellent opportunity for backpacking, riding, fishing, 
hunting, swimming, and nature study plus a complete camper 
store can be expected to draw mostly destination camperse 
Many parks can thus be classified according to the same 
paradigm used for campers. 

However, just as campers can exhibit a variety of 
behaviours on one trip or throughout a camping season, SO? 
tao, can campgrounds exhibit different characteristicse 
Large, well-developed parks located near highways or urban 
areas may attract a mix of destination, enroute and commuter 
; camperse The immediate implication is that, for some parks, 
one type of travel model would not be adequate for 
projection purposese a is necessary; of course, that the 
planner be able to accurately determine the character of the 
park -—- from an on-site visit, visitor records, design 
specifications and relative lLocatione If this cannot be 
done, or if (1) a group of parks with different traveller 
characteristics or (2) one park with a variety of uses, is 
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being studied, then a more complex model with a _ series of 
equations relating to the different camper types is 
necessarye (See TN 40, Figures 4, 5 and 6.e) 

In the discussion that follows, destination and 
commuter campers are treated as one group and enroute 
campers as anothere As will become clear below, it is felt 
that the criteria used by the two groups for choosing a 
campground are sufficiently different to necessitate two 
structurally different models to predict their choice 
behavioure The same model can be used for destination and 
commuter campers because both are choosing a destination by 
Similar criteriae However, different equations are developed 
for commuter and destination campers because it is assumed 
that the commuter camper puts more weight on distance than 
either the weekend or longer-duration destination campere In 
other words, the difference in the importance of distance 
requires separate calibration of the same model for each 
groupe : 


MODEL 


Two sets of equations, one for enroute campers and one 
for main-destination campers will be derivede The problem of 
predicting visitor flow volumes is twofolde First, given the 
number of potential camper parties in any population centre, 
i, it is necessary to predict the actual number of camping 
parties, NC i )e Next, it is necessary to predict what 
proportion of parties from origin i, will make an overnight 
stop at park je This proportion is P(i,j) 


WHERE @E Plisj) = 1 and the summation is over all ie 


Given values for N(i) and Plisj)y, the number of camper 
parties from origin i visiting park jj in a given time 
period, is defined quite simply as: 


(1) VCisj) = NCOiIPCi,j) 


It is now necessary to obtain some methods’ for 
calculating N(i) and Pliygj)e Since the enroute camper model 
involves only a small simplification of the main-destination 
model, the following discussion will concentrate on the 
main-destination modele Once the main-destination model has 
been derived, the equivalent enroute model will be 
formulatede 


Participation Component 


In the case of main-destination campers, participation 
is assumed to be a function ot (1) the population and socio- 
economic characteristics of a population centre and (2) the 
accessibility attractiveness of camping facilitiese 
Following the development in Cesario et ale, (see Reference 
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'~=16) a participation function for such park use is: 
(2) NCA D=KSC ERG PCL IEKB ( DE ACG)/Clig J HEY ERG 
WHERE 


NC) = the number of camper parties per time unit who go 
| camping from centre i; 


PCa) 


= the population of centre i; 
s(i) = the average family income in centre iy, used as a 
surrogate for the socio-economic characteristics of 
centre i; 
ACJ) = the "attractiveness" of park Jj; 


mc( isd) = the cost of travel from origin i to park jy, defined 
| in terms of time or money; 


om = the number of parks; and 


Ks3@38eV3% = parameters to be estimated empiricallye 


and the summation is over all parkse 


Equation (2) is option 1 of a participation component 
in the main-destination modele It states that if S(i) and 
P(i) are held constant, N(i) is a non-linear function of the 
sum of all parks!’ attractiveness/ accessibility ratiose (See 
TN 3and 11 and Chapter I Review of this Volume for a 


| further discussion of the role of such factors in 
_destination-use modelse) An alternative formulation of the 
participation component, option 29 is developed by 


considering the option as containing two elements: 
(3) NQCi) = k SCi)**q@ PCid*¥*B FCAC JG), ClisgJj)) 


WHERE 

~—FCAC GIG)», ClisJj)) the probability of a potential participant 

from i visiting any of the m parks at all 
in a given time period, where that 
probability is a function of the 
attractiveness (A(j)) and accessibility 
(C(Cj)) of atl m parks; and the other terms 
are defined above 


In the first element, the number of potential 
participants is some function of S(i) and P(i)e The second 
element is the probability of a potential camper visiting 
any park per time unite This is a function of the 
attractiveness and accessibility of all parkse Thus, unlike 
option 1, the probability of participation is explicitly 
considered and is defined in terms of the attractiveness and 
accessibility of all parkse 
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The probability termy FC AC Jj), Clisj))» in Equation 3 is 
determined as followse First, compute the probability ofa 
camper deciding to visit no park at alle Represent this 
probability as: G(AC jj), Clisgj))e If we can assume that 
FCAC Jj), ClisJj)) for each park is independent of the 
probability function of other parks, the rule of 
multiplication from conditional probability theory enables 
G( AC j), ClisygJj)) to be defined as: 


(4) GCACY)sClisJs)) PI [1-FC AC j )—-Cl iy j)) ] 


WHERE 
1 —- FCAC J), CCirygj)) 


i deciding not to visit park j; and 


number of parkse 


m 
and the probability is. calculated over all parkse 


Given that the probability of visiting any park at all 
is one minus the probability of not visiting any park at 
alls: 


(5) FCAC J), CCisj)) = 1 — GOAC JI» Clisj)) 


It follows from Equations 4 and 5 that Equation 3 can 
be rewritten as: 


N( i) = k SC i)**a@ PC i)**B [1 — PROD FC AC Jj),CCisd)) | 
As developed in Cesario et al, let: 
(6) FCAC YG)» Cliegj)) = EL ACZj)/CCi,s Jj )**Y 


WHERE EL = the "elasticity" of the probability of a visit 
with respect to accessibilitye 


Therefore, Equation 5 can be rewritten as: 
(7) N€id=k SCi)*¥a PCi)**S [1-PROD (1-EL ACJ )/CC ig j )¥**Y) |] 
WHERE the product is calculated over m parkse 


This is Option 2 for a participation component of the 
main-destination modele 


Distribution Component 


When estimates of participation in camping are Known 
for all the origins in the system, the trips are distributed 
to all the parks in the systeme The attractiveness of, and 
the "cost of reaching, the parks in the system from the 
origins will determine the camping use that a particular 
park receivese Following the development in Cesario et ale 
(1969), the probabilistic distribution function for main-— 
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the probability of a camper from centre 


destination campers, and enroute campers, is: 
(€8) VWOigJI=NC1L) CAC GICCIi gg D¥*Y/ DE ACK ICCA gk )*¥*Y ) 


WHERE 
—WCiss) = the number of overnight camper parties that park j 


receives from origin i during a period of time; the 
other variables are as previously defined; and and 
the summation is over all parkse 


(The function is probabilistic in that 
(OE (CACSI/ CCA GFE I/ DE CACKI/CCO A KEY) ) 


for any one park, j+s the expression defines the probability 
of a camper from centre i visiting park je 


Now incorporating the two options for the participation 


component, Equations 2 and TW; into the distribution 
function, Equation 8, gives the two alternative complete use 
models for main-destination campers: 


(9) Vig gd=k SCL)*KRa PCLDR*B ( DE ACK)/CCigk )**Y HB * 


{(ACigj)/Cli,g Jj )*¥*¥Y)/ DE ACK)/CC i,k )} 


=k (SCi))**q@ PC i)**S ODE ACK )/CCi,gk)**Y )**g-1 * 
{AC j)/CCigs j )**Y} 


WHERE the summations are over all parkse MODEL 1 


(10) VWCigkd=k SCi)*¥*a PC Id)**XB * 
{1 — PROD (1 — EL AC j)/CCiyJj )**7Y} 


WHERE the summation is calculated over m parkse MODEL 2 


For enroute campers, only one modification is made in 
Equations 9 and 10e Participation in this case is assumed to 
be a function of both the volume of camper vacation traffic 
and the overall attractiveness and accessibility of parks to 
the routes followed by that traffice Unlike the main-—- 
destination model which uses origin socio-economic data to 
estimate its probable camper population, the camper 
population is defined in the enroute model as the volume of 
camper traffic on a particular highwaye Therefore, the 
socio-economic variables used in Equations 9 andi0O are 
irrelevant in estimating the participation component of the 
enroute camper modele However, the term defining the overall 
attractiveness and accessibility of parks to routes used by 
camper vacation traffic is still relevante Accordingly; the 
enroute model omits S(i) in Equations 9 and 10 and defines 
P(i) in these equations as the volume of camper vacation 
traffic on route segment ie 

If it is assumed that all camper vacation traffic will 
make an overnight stop at one of the parks under study, then 
a simplified form of the enroute camper model can be used; 
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specifically: 
C11) WCigkdJ=k PC i) ACG) Cligj)**Y/ DE A kK Cligk )¥*Y 


As the total number of camper party visits has been 
estimated by the model as VWliegj), and number of nights per 
party is obtainable from on-site records of surveys, it is 
reasonable to consider these as two different phenomenae 
Camper party-nights is a figure representing the number of 
camper parties multiplied times the average number of nights 
per visits: 


Total party nights = VWlisj) x average nights/partye 


This simple calculation is meaningful for destination 
campgrounds at enroute campgrounds, although the average 
lenth of stay at an enroute campground will be close to 1e0c5 


DATA NEEDED 
The icllowing data are needed to operationalize 
Equations ¢ and 10 for both main-destination and enroute 


campers: 


1e The total number of parties from origin i visiting parx 
J in a given time periode 


Ze The quantity (and qualities, if possible) ot the 
facilities of park je 


Je The quantity (and qualities, if possible) of camping 
facilities of park je 


4ac50 The distance between origin i and park je And/or 


Abe The time required to travel the distance from origin i 
to park je And/or 


4ce0o The money cost of travelling the distance between 
origin i and park ij; and the camping permit feee 


Sae The cistance between park j and the nearest point on an 
arterial or major highway carrying vacation tratfice 


And/or 


She The time requirea to travel the distance trom the 
nearest point on an arterial highway to park je And/or 


5c e The money cost of travelling the distance from the 
nearest point on an arterial highway to park j, and the 


Campiny permit treee 


Ce The averaze tamily income of origin ie 
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Te The population size of origin ie 


8e The volume of camper vacation traffic on arterial 
highway segment ie 


DISCUSSION AND CONCLUSION 


As already mentioned in the section, Theoretical 
Considerations, the models defined in Equations 9 and  i( 
refer only to enroute campers who are ‘making a bee-line' 
for a particular destination, and not to those who are 
willing to make a detour to one or more places of interest 
off their main routee Similarly, with main-destinatioz 
campers, the models refer only to those campers for whom the« 
attributes of a particular park (including its distance) ar« 
the sole criterion for choosing a particular camping 
destinatione The latter restriction could be partially 
relaxed if attributes of the region in the immediat« 
vicinity of the park were measured and used to define A(j). 
However, the models would still not cover those campers 
whose destination is a wider region within which several 
distinct destinations exist, Since in that case the choic« 
of one destination is not independent of the others chosen. 
To model such touring campers would require a structurally 
more complex model, as well as more detailed itinerary 
informatione 

As mentioned, it is clear that trip duration affects 
the choice of destinatione Specifically, the more time =« 
camper has», the greater the range of alternative 
destinations available to hime In terms of the models 
developed, the exponent on C(li,j) is expected to decrease as 
trip duration increasese However, it is also conceivable« 
that the importance attached to different features of a part 
which influence its attractiveness will vary with triy 
duratione Thus, while back-packing facilities may be ar 
important feature in a park for those contemplating a stay 
of three or more days, these facilities may assume much less 
importance than would, say; a campsite with a lake frontag« 
for a weekend campere While these variations do not 
necessitate any structural change in Equations 9 and 10, it 
is suggested that calibrating such equations using all 
camper visits irrespective of total trip duration might 
result in poorer predictive accuracy than if separate 
calibrations were performede The latter, of course, woulc 
necessitate data disaggregated by trip duratione 

Although the enroute model assumes that data on the 
volume of vacation traffic on highways adjacent to <«¢ 
particular park defines a user population and that the 
number of overnight stops at a park is a direct function of 
that volume, there is one instance in which this assumptior 
may produce sizeable errorse 

Let the word “origin" refer, for the moment; 
not necessarily just to home or work location but to the 
starting point for any day's jjourneye For example, for ar 
enroute camper going from Ontario to Alberta, his origin or 
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a given day may be the park he stopped at the previous 
nighte If, for the volume of vacation traffic measured on 
any highway segment ig the distribution of distances to 
origins is the same as for any other highway segment, then 
there is no problem in directly relating VW(li;j) to that 
volume of vacation traffice However, there are few obvious 
and important highway segments that will have distributions 
of distances from origins that are decidedly different from 
those of other highway segmentse An example should help 
clarify thise On the highway segment immediately north of 
Toronto leading towards Sudbury and the west, a large 
percentage of camper vacationers heading towards Sudbury 
will have originated in Torontoe Thus, although the volume 
of vacation traffic may be vary high on that route, it is 
unlikely that many will make their first overnight stop 
within, say, 50 to 100 miles of Toronto, even if they left 
Toronto only late that afternoone If on any highway segment 
a relatively large percentage of vacation travellers comes 
from a nearby originy the proportion making an overnight 
enroute stop at a park in the vicinity of that highway wilt 
be less than would normally be predicted from that volume of 
traffice Clearly, this problem will be most acute in the 
vicinity of metropolitan arease The point is that the simple 
volume count of vacation traffic on a highway segment is 
likely to be a much poorer predictar of the number of 
enroute stops at an ajacent park then the use of frequency 
distirbution of distances to those travellers!’ origins for 
the daye 

A final problem that can be noted is that both 
Equations 9 and 10 are non-linear functions with, 
respectively, four and three parameters to be solved for, if 
the values of park attractiveness are assumed to be defined 
exogenouslye If AC j) values are to be estimated within the 
model then Equations 9 and 10 have mt 4 and m + 3 unknown 
parameterse It is not certain that any reliable and 
efficient solution technique existse Particularly with m + 4 
and m + 3 unknowns, a heuristic non-linear regression 
algorithm is unlikely to converge on a solution in any 
reasonable timee However, it is more Likely that with three 
or four unknowns, a non-linear regression procedure could be 
used effectively, although even in this case optimality is 
not assured and required computer time may be largee 

In conclusion, the proposed models of enroute and main— 
destination overnight park use have been argued to be 
applicable to a certain well-defined type of main- 
destination park user and a certain type of enroute park 
usere Insofar as there are many minor variations in camper 
types, it may be reasonable to consider developing a more 
general model which could subsume the types considered in 
this paper as special casese In the process of deriving such 
a model, care should be taken to see that it is 
mathematically solublee 


TN 30 CHAPTER II page 98 


CORD STUDY TECHNICAL NOTE 4 


ESTIMATING PARK ATTRACTIVENESS, 
POPULATION CENTER EMISSIVENESS, 
AND THE EFFECT OF DISTANCE (LOCATION ) 
IN OUTDOOR RECREATION TRAVEL 


Fe Je Cesario 


ABSTRACT 


This report contains a discussion of the results of the 
| application of a two-stage methodology for estimating 
certain parameters of outdoor recreation travel. Stage I 
' analysis results in extracting systematic travel (distance), 
population center  (emissiveness) and park effects 
(attractiveness), from outdoor recreation trip-making data 
'by the use of an analysis of covariance techniquee A 
“reaction to travel parameter" is also determinede Stage II 
analysis is where the researcher attempts to account for 
| differences in estimated population center (emissiveness ) 
and park (attractiveness) effects by of mUlLtivariate 
_analysese The goal at this stage is to identify those 
population center characteristics (population; mangitudes, 
| income, age, etce) that appear to best account for observed 
_variations in aggregate outdoor-recreation trip-making 
behaviour and to identify park characteristics, etce that 
account for attractivitye 

Stage I analysis of Ontario data obtained from camping 
_permits showed that statistically significant population 
center, park and location effects on camper trip-making 
_could be extracted from the 1968 Ontario data on all use of 
Ontario campgroundse 
| Stage ITI analysis had some expected consequences and 
| some unexpected consequenceSse Emissiveness (city effects) 
were found to be explained by city size with income not 
being importante It is well known that the volume of 
| visitor flow from an origin to a destination is proportional 
to the population of the origin, but income is usually also 
considered to influence the volume of flowsSe For 
attractiveness, it was found that such expected factors as 
size of park and numbers of campsites had a positive 
influence on park attractivenesse But, the complexity of 
the relation discovered has not been recognized in the paste 

The ways in which the results can be put to practical 
use by planners and researchers are emphasizede 


INTRODUCTION 


In the planning and design of park systems, outdoor 
recreation agencies are continually taced with a series of 
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complex questionSe How many parks are needed? Where should 
new parks be located? What existing parks should be expanded 
and in what ways? What mix of what activities should be 
provided on what sites? How many camping spaces should be 
provided? The List of such planning questions is virtually 
endlesse Since recreation budgets are not unlimited, hard 
choices must be made in deciding exactly what levels and 
mixes of recreation opportunities to provide and where to 
provide theme 

It can be realistically argued that rational planning 
would involwe a systematic exploration ot alternative plans 
and policies in a formal or systematic way: that is, an 
"informed" choice would take into account all (or as many as 
possible) of the consequences of alternative solutions to a 
particular planning problem before deciding upon a specific 
course of actione In many cases, the park use implications 
of alternative plans and policies are ot paramount 
importancee For example, a recreation agency might wish to 
choose that alternative which promised to result in the 
greatest increase in regional recreation visits (properly 
distributed)e But then it is imperative that the planning 
agencies have on hand considerable relevant information with 
which to assess the visitor implications of each financially 
feasible plan of recreational development that may be 
considered when resources are to allocatede 

In an attempt to develop this relevant planning 
information, the last ten years or so have seen many outdoor 
recreation researchers working on the construction of 
"models" which purport to “explain" or “predict"™ various 
aspects ot outdoor recreation trip-making behavior ( see 
References 6, 17, 19, Z7_ 35e) These models attempt to 
relate various measures of park use to the factors that give 
rise to theme They have, in fact, provided many insights as 
to what factors are and are not importante 

Yet, these modeling efforts have been plagued by some 
common citticultiese The modeler is inevitably faced with 
the complex problems of, first, selecting the appropriate 
variables to include in his model and, second, postulating 
how these variables might combine to give rise to a 
particular use patterne In Short, there are major unresolved 
problems concerning both the substance and the form of the 
relationshipse In many cases the variables are arbitrarily 
chosen —- perhaps on the basis of data availability - and the 
model is constructed within a Linear framework because that 
class ot model is easily solublee But it is a rather 
restrictive approache It may be possible to do bettere Such 
improvement is a goal of this studye 

This report contains the results ofa pilot study 
carried out in Ontario, using a new and untested methodology 
developeu by the author, to identiziy those factors which by 
themselves and in combination appear to he the most 
important intluences shaping park use levels and related 
patterns in outdoor recreation systemse AS will be seen, 
this new methodology represents a substantial departure from 
tErevious methodologies and, with appropriate qualifications, 
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offers considerable promise of (1) allowing better 
prediction of visitor flows than has been possible, and (2) 
affording insights into the particular factors that are 
important in determining these flowse Utilizing findings 
obtained in this study and possible future applications of 
the research methodology, outdoor recreation planners and 
researchers) will hopefully be able to make improved 
decisions in their respective areas of responsibilitye 


FRAMEWORK FOR ANALYSIS 


The purpose of this investigation is to test, on a 
pilot study basis, a research methodology to identify 
factors that appear to be important determinants of park use 
and to uncover the particular (aggregate) characteristics of 
specific parks and of specific population centers which give 
rise to observed variations in regional outdoor recreation 
tripmaking behaviore In addition to revealing these 
population center and park effects, the effect of distance 
(ieee, Location) on participation is isolatede 

For analysis purposes, assume that the region of 
interest (@ege,e part of a province, an entire province, 
several provinces) is subdivided into N mutually exclusive 
population centers (e€egeyg counties, townships, census 
tracts )e Assume that there are M outdoor recreation parks 
located ing and possibly out of, the regione The distance 
between population center i and park j, as measured (say) by 
the number of road miles over the route of minimum distance, 
is denoted by dli,sijde Let (dli,gj)) be an (NxM) matrix of 
trip distancese Let vl(i,j) represent the number of visits 
ot a particular type observed to emanate from population 
center i and terminate at park  j during some time period of 
finite duratione Then (v(i,Jj)) is an (NxM) matrix of trip 
frequenciese 

Let the total number of recreation trips emanating from 
population center i be denoted by O(i); tet the total number 
of trips terminating at park j be denoted by V(j)3; and let 
the total number of trips taken in the region be denoted by 
We Then: 


ed OLA m5 DE, VA dds sou hh 2a i iwi N 

WHERE the summation is over M parkSe 

(2) VWOj) = OE vied) 5 J = Lle2e cvogM 

WHERE the summation is over N population centerSse 
(3) We= OE OCi) = ODE Wj) = OE OE vlisJj) 


WHERE: the i's are summed over N population centers; 
and the j*s are summed over M parkSe 


From the above, it is seen that V(j) is merely the total 


TN 4 CHAPTER II page 101 


TABLE 1 


TRIP DISTRIBUTION TABLEAU 


J Parks @E vlisJj) 
i 1 P 4 eee M 

1 vil vi2 eee viM O1 

2 v21 v22 @@ ®@ v2M 02 

N vNi1i vN2 eee vNM ON 
dE vlisJj) Vi : V2 eee VM W= OE OC i) 
= @E Vij) 

<><> <>< ><> 


visitor flow from the region observed at park j during the 
relevant time periode The above quantities of interest are 
displayed in the tableau of Table le This analysis assumes 
that trip-making data of the above type are available for at 
least one time period (C€egey a Summer sSeaSon)e Further, it 
is assumed that it is possible to measure certain overall 
socio-economic characteristics of the population centers as 
well as specific site characteristics of the parks under 
consideratione In this context, we then ask what it is 
possible to learn trom these datae 


METHODOLOGY 


The methodology employed in this study is a two-stage 
estimation procedureée The first stage of the anlaysis 
extracts systematic population center and park eitects from 
the data structured as in Table le The second stave attempts 
to identify the population center and park characteristics 
that appear to account tor ditferences in these effectse 
Before discussing each stage of the analysis in detail, it 
is useful to present a briet overview of the underlying 
modeling postulatese 


Theoretical Basis 


Basically, it is postulated that the number of 
recreation trips made from population center i to park j is 
a’ function: of Cr) certain population center characteristics 
such as population, size, income distribution and measures 
of other socio-economic factors, C2 certain park 
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characteristics such as acreage, miles of shoreline and 
other natural or man-made features, and (3) spatial 
separatione Further, it is assumed that population center 
characteristics relate to each other ina certain waye 
‘Similar assumptions can be made with respect to park 
characteristics and spatial separatione Thus the number of 
visits from population center i to park j may be expressed 
ass 


(4) VCi»ej) = f£( gl( population center characteristics), 
| e€2 (park characteristics), n( d(lis,j)) de 


It is assumed that gl and g2 are independent of h(d(isJj)) 
and that their particular functional forms are unknowne The 
functional form of h(€d(i,zj)) is presumed to be specified 
except for values of its unknown parameters. Viewed in this 
way the estimation problem is one of finding appropriate 
functions g1 and g2 as well as any unknown parameters of the 
particular distance function h(d(i,j)) employede 

| In a lengthy development, Cesario (see Reference 8) 
made the above concepts operational by formulating the 
following model structure: 


(5) VCieg) = KECi JAC j)hO DC is Jj) Dexp( adel isj)) 


to be representative of the data, where K is @ constant of 
proportionality; E(i) is an unobservable population center 


factor callea “emissiveness"; A(j) is an unobservable park 
factor called “attractiveness"; h(d(i,j)) is a particular 
distance function; and @(i,j) is an error terme The 


quantities v(i,j) and d(i,;j) are observable whereas E(i), 
AC j) and parameters of h(d(i,j)) are unobservable and must 
be estimatede Cesario (see Reference 7, 8) indicates 
vectors (E(i)) and (AC j)) are each specified up to 
multiplicative constants (ieee only ratios E(i)/E@(K), k # i 
and A( j)/AC£E), £ # j have meaning). 


Thus, emissiveness of population center i, EC i), 
reflects its relative propensity (to other population 
centers) to emit trips under identical circumstances —- (that 
is, as if all centers were confronted by the same 
availability or “supply" of recreation opportunities )e 
Emissiveness thus serves as a relative measure of population 
center "participation" that is unencumbered by the existence 
of a differential supply factore It is seen to be a metric 
that represents the combined effects of a multitude of 
population center characteristics on recreation trip-makinge 
It is possible, of course, to think of many reasons why one 
origin might be more emissive than anothere One obvious 
reason is due to the possible differences in population 
sizes among population centerse Another reason might be due 


to differences in socio-economic composition of the 
populatione Each of these factors needs to be investigated 
to discover’ the role each plays in giving rise to 


differential E(i) values (to estimate gl )e 
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In contradistinction to emissiveness, attractiveness of 
park j, A(j), reflects its relative ability (to other parks) 
to attract trips under identical circumstances (for example, 
as if all parks were equally accessible from each population 
center)e Attractiveness thus serves as a measure of park 
"“nopularity". It is ametric that reflects the combined 
effects of a multitude of park characteristics on recreation 
trip-makinge There are many potential reasons why one park 
might be more appealing than anothers: one might have 
excellent swimming facilities whereas another does not; 
another might have an excellent network of hiking trails, 
whereas another has nonee Again there isa need to account 
for differential A(j) values in terms of these factors (to 
estimate g2)e 

Finally, the distance function n(Od(i,j)) is a 
"correction" term to account for violations of the above 
ceteris paribus conditionSse Various functional forms can be 
usede Common examples-are hld(i,z,j)) = dliygj) [8], nl dli,sy)) 
= explSd(igj)), and hldliegj)) = dli,zj) [8] expladli,j)) 
where q@ and # are parameters to be estimatede Each of these 
tunctions has ditferent behavioral implications (see 
Reference 14)e 

From the above, we see that the process of extracting 
relevant factors affecting park use may be thought of in two 
stagese Stage I would involve the estimation of vectors 
(£0i)) and (AC j)) of Equation Se Stage II would determine 
the combinations of population center characteristics and 
park characteristics that appear to account for whatever 
aifferences were found to exist in the E(i)"*s and AC j) "se 
This is the essence of the methodology under the teste 


Stage 1: Estimation of Vectors (E(i)) and (CACi)) 

A logarithmic transtormation ot Equation 5 for a 
eparticular h results in what is essentially an analysis of 
covariance model from which estimates (E£(i)) and (A(j)) can 
be ultimately obtainede Letting h€d(i,j)) = dli,j) [8] for 
convenience, we form: 

(6) VWCiej) = KECIDAC JIC isj) [8B] explElisj))- 
Taking Logarithms of both sides of Equation 6 yields: 
(7) IUUnv€@igj) = LnK * LnECid) + UnACy) + Blndli,szj) + Eligyde 
Letting 
yCiej) = Lnvli,j) 
m = nk 


e( i) LnECi ) 


a( j) tnAC Jj) 
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xC i,j) = Lnd(i,Jj) = nd(i,Jj) 


produces the Linear model 
(8) Y¥Ciezj) = m +t e(i) +t aC yj) + Bx(i,gj) + Sli,yjde 


In Equation 8, yliej) and x(i,yj) are given and e(i), alj)s B 
are unknowne Estimation by the usual least-squares method 
yields vectors (e(i)) and (a(j)) which, when transformed by 
taking antilogarithms, yield (biased) estimates of (E(i)) 
and (AC j))e That is, exp(eli)) is approximately equal to 


AC j)e These estimated parameters will have the same 
interpretation as provided in the previous sectione 
If one is willing to make some rather strong 


assumptions on the distribution of el isd), a test for the 
statistical significance of $8 can be made, and hypotheses 
about E(i) and AC j) can be tested by the usual (modified) 
analysis of variance methodse (See References 10, 29-) In 
addition, various contrasts of population center and park 
parameters can be tested to sort out homogeneous groupingse 
See TN 19 for an expanded discussion of these pointse 

As implied by previous discussion, the Stage I analysis 
provides information on the relative "importance" of 
population centers, parks, and location in accounting for 
the observed total variation in population center to park 
trips (ieee, visitor flows)e For instance, if all E£(i)*s are 
equal, differences between population centers do not 
contribute to this variatione Likewise, if all A(j)*s are 
egual, differences between parks do not contribute to the 
total variation in use at least in a simple way (it is 
possible that the effects of some factors may cancel out )e 
If, on the other hand, some £(i)*s and/or A(j)*s are unlike, 
then some variation in v(i,zj) is due to these factors and 
further investigation into the causes of these differences 
would be in ordere Finally, the magnitude of the distance 
parameter $8 provides location inferences. For example, if B 
is negative and large, it indicates that the elasticity of 
recreation trips with distance is large or, more simply, 
that visitor flows fall off rapidly with small increments in 
distancee On the other hand, if @ is identically 0, then 
(everything else being equal) the same visitor flows would 
be achiewed for a park no matter where it was locatede If B 
is positive, then the plausibility of the results or the 
data would be in questione 


Stage II: Accounting for Differences in 
(E£C4)) and (4Ci)) 


Assume that estimated parameter sets (£(i)) and (AC j)) 
are available from Stage I, where E£(i) and A( j) represent 
estimates of E(i) and AC j) respectivelye Also assume that 
not all £(i)'s are the same and not all A(j)'s are the samee 
The Stage II problem is to account for differences between 
£(i)d*s and A(j)"*se The quantities (£(i)) and (AC j)) are 
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taken to be the dependent variables in multivariate analyses 
which have as their purpose the selection of a set of 
independent variables that appear to account for variation 
in these dependent variablese In the E(i) analysis, 
independent variables consist of various measurable 
characteristics of population centers which might make one 
center more emissive than anothere In the A(j) analysis, 


independent variables consist of various measurable 
characteristics of parks which might make one more 
attractive than anothere For applications where few 


independent variables are involved, standard Linear or 
nonlinear regression approaches might be used; in more 
complicatea problems where many variables are expected to 
interact in complex ways, the Automatic Interaction Detector 
Technique (AID) is a useful approache 

While the use of standard regression analysis is 
commonplace, a few words of explanation about the potential 
usefulness of the AID technique might be in ordere The 
technique developed by Sonquist and Morgan (see Reference 
31) is a multivariate method of analysis which has as its 
purpose the "classification" of data into homogeneous 
ZroupSe This empirical method is free of many of the 
traditional strong, usually false, assumptions implicitly 
employed in the analysis ot data by techniques such as 
regression analysise Given a set of data (measured on 
nominal, ordinal or interval scales) on some "independent? 
variables, the A1D technique determines what variables 
combine to produce the greatest discrimination in group 
means of the dependent variablee The total population of 
observations is divided through a sequence of binary splits 
into mutually exclusive "terminal" subgroupse At each stage 
of the sequence, the dichotomization occurs so as to provide 
the Largest reduction in the unexplained sum of squares of 
the dependent variablee That group means is chosen that 
account for more of the total remaining sum of squares of 
the dependent variable than the means of any other possible 
combination of independent variablese In this way, the mean 
values of the aderendent variable wilt be as different as 
possible between groups, but as equal as possible within 
eEroupse The output of the analysis is a so-called "tree 
diagram" which shows the subgroups formed at each iteration, 
along with associated statisticse (A very readable example 
of the use of AID in a marketing context is given by Assael, 
see Reference Ze Cesario, see Reference Qy discusses a 
hyrothetical use of AID in the present contexte ) 

The relevance of this AID technique to the Stage il 
problem is Obviouse Using emissiveness and attractiveness 
factors estimated in Stage 1 for large numbers of population 
certers and parks, respectively, together with data on 
Particular population center and park characteristics 
hyrothesizea to ;::ive rise to emissiveness and attractiveness 
differentials, AID is used to "classify" population centers 
and parks into homoveneous vproupingse Beyond the informative 
asrects of these ALD results - tinding out which variables 
are iaportant and which are not - it becomes possible for 
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planning purposes to make informed estimates of the 
attractiveness of any new or altered parks for which 
appropriate measurements are available, and to assess the 
associated park use implicationse This systematic approach 
may be contrasted with the procedure of defining 
attractiveness in an ad hoc waye (See TN 1 and others cited 
there.) It follows that results of AID analysis will provide 
researchers with a more objective basis for the construction 
ot improvea visitation modelse. 


DATA BASE 


To implement the proposed two-stage methodology 
described in the previous section, it was necessary for the 


Stage I analysis to construct (v(i,j)) and (d(i,gj)) matrices 
for one or more visitor types in a particular Canadian 
region, and for the Stage II analysis to collect information 
-on the characteristics of population centers and parks in 
+ the regione 


The Study Region 


For statistical reasons it was desirable to select a 
study region for which a reasonably large data tableau of 
the type shown in Table 1 could be constructede To minimize 


bias it was desirable to also work with a matrix containing 


as many positive (hopefully large) entries as possiblee 


Recall that the parameters £01) and A( Jj), which are 
-antilogarithms of e(i) and a(j) respectively, are biased 
estimates of E(i) and A(j)« This bias is attributable to the 
_ fact that in the “lLog—linear" model given by Equation 8 
undue weight is given to small visitor flowse 


Existing data sources were examined with bias and other 


considerations in minde After examining the potential for 
each region in Canada to provide an appropriate data base 
for estimation purposes, it was determined that the region 


of Southern Ontario bordered roughly on the North by the 


cities of North Bay and Sudbury, on the East by Ottawa, on 
the South by Lakes Ontario and Erie, and on the West by Lake 


Huron and Georgian Bay, would be suitablee This region is 
shown in Figure le The region of Southern Ontario contains 


recreation parks operating under several different 
jurisdictions, —- e@ege,y Parks Canaday Ontario Provincial 
Government and various Conservation Authoritiese 


Unfortunately, due to paucity of data it was not possible to 
include all of the region's parks in the studye 
Concentration was focused on the analysis of data for as 
many provincial parks as possiblee 


Visitor Types 
Consistent with the Canadian Outdoor Recreation Demand 


Study classification, visitors to recreation areas can be 
broken down into the following types: 
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Ae Day-use, main destination 

Be Day-use, stopover 

Ce Overnight—-use, main destination 
De Overnight—use, stopover 


Visitor type A (day-use, main destination) is represented by 
that party which originates a recreation trip at home, 
visits only one park and returns home on the same daye 


Visitor type B (day-use, stopover) leaves home and visits 


two or more sites on the same day before returning homee 
'=Visitor type C (Covernight—use, main destination) embarks on 


a camping trip from home and travels to one and only one 


camping site before returning home after completing the 


Staye Visitor type D (overnight-use, stopover ) is typified 


by vacation camperse This party leaves the home and whether 


or not an ultimate single destination is the goal, stops for 
short periods of time at many different campgrounds ona 
Single trip before returning homee Each of the above visitor 


types is motivated by a different set of factors (each has 
different amounts of leisure time available for the trip) 
and conceivably can be considered separately for modeling 
'purposese It is important to note that the methodology being 
applied in this study is most relevant for the analysis of 
the trip-making patterns of visitor types A and Ce 


Attention was focused, then, on securing data with which to 
construct (v(i,j)) matrices for one or both of these visitor 
typese 

In connection with Visitor Type Ay, data for day-use 
visitor types at thirty-five parks were available from CORD 
Study surveys conducted in 19693 however, there were 
problems in defining relevant sampling ratese In addition, 
1971 data (10 percent sampling rate) were available from the 
Ontario Department of Lands and Forests (ODLF)e Due to very 
serious problems in establishing the populations to which 
the observations in both surveys belonged (ieee, total 
visitor flow by type at a park on any given sampling day) 
these data could not be used with any confidence as 
indicators of actual day-use trip patterns and levelse 

With reference to Visitor Type By camper data for main 
destination and stopover types were also available from CORD 
Study tabulationse Further, the ODLF provided detaitied 
tabulations based on a100 percent canvass of camping 
permits issued during 1968. Unfortunately, ODLF data were 
not broken down into main destination and stopover 
categoriese The sample did, however, provide an accurate 
estimate of the total population of campers (the CORD Study 
data on campers and day-use visitors suffer from problems in 
defining sampling rates )e 

In collaboration with the Outdoor Recreation Research 
Section, Parks Canada, it was decided that Visitor Type C 
(main destination campers) would be the most appropriate 
group to consider in the pilot studye A mixture of CORD 
Study and ODLF data were then used to construct a relevant 
matrix, (v€isj))e The ODLF data were used to define an 
initial (v(€i,j))Je But since the ODLF camper data do not 
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distinguish between main destination and stopover use, a 
correction procedure, based on an analysis of CORD Study 
main destination/stopover data, was developed to separate 
the ODLF camper quantities into estimates of the number of 
campers in main destination and stopover categoriese This 
procedure is described in the Appendixe 


Compitation of the Data Set 


The population centers chosen for use in this study 
were 46 cities in the region shown in Figure 1 having a 1970 
population greater than 9,000, according to Statistics 
Canada estimates (ieee; N = 46)ce In addition to population 
sizes, average income data were the only socio-economic 
information available for use in the Stage II analysiSe 
(Actually, data on average income were only available for 30 
out of the total of 46 population centerse) Table 2 presents 
a list of the 46 Southern Ontario population centers used in 
the study with population data ana average income estimates 
that were availablee 

For the 46 population centers identified in Table 2, it 
was possible to compile main-destination camper estimates 
for 50 provincial parks in the Southern Ontario region 
(ieee, M = SO)e In anticipation of the Stage II analysis, 
data on park characteristics were collected from various 
published reports of ODLF and otherSse Information on 15 
characteristics was available for each of the 50 £xparks 
included in the studye Table 3 presents the park 
characteristics information collected ana gives the coding 
system used in the AID analysise Table 4 Lists the 50 parks 
included in this study along with (coded) associated 
characteristicse 

The (46 x 50) initial camper matrix (vli,sj)), as 
compiled from ODLF records, is available upon requeste These 
data include both main destination and stopover userse For 
input to Stage I of the analysis, an estimate of the main 
destination component of each element of this aggregate 
(v(isj)) matrix was extracted by use of the procedure 
described in the Appendixe 

Roaa mileage over the route of minimum distance was 
used as the relevant measure of the spatial separation 
between population center i and park jj, aQCi,syj)e These data 
were compiled by the Outaoor Recreation Research Section 
using DHC highway Link data supplied by ODLFe The (46 x 50) 
matrix (d(i,yzj)) used in the analysis can be obtained trom 
the Socio-Economic Research Division, Parks Canadae 
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Code Noe 


ITN 


oowavwkh Wb = 


TABLE 2 


POPULATION CENTER DATA 


Population Population 
Center (City) 1969(a ) 
Brantford 62,594 
Ottawa 297,701 
Ste Thomas 24 g 520 
Leamington 10,082 
Windsor 198,997 
Kingston 59,029 
Owen Sound 18,189 
Burlington 78,590 
Georgetown 14,964 
Oakville 58,007 
Belleville 34,190 
Trenton 14,251 
Chatham 34,158 
Wallaceburg 10,637 
Sarnia 56,407 
Smith's Falls 9,701 
Brockville 19,565 
London 211,699 
Simcoe 10,447 
Cobourg- 20,163 
Port Hope 

Ajax 11,273 
Oshawa 86,185 
Whitby 223,103 
Woodstock 25,314 
Brampton 39 ~ 232 
Peterborough 57,337 
Hawkesbury 9,240 
Pembroke 16,431 
Barrie 26,212 
Midland 10,646 
Orillia 20,542 
Cornwall 46,576 
Lindsay 12,483 
Galt 36,734 
Kitchener— 

Waterloo 138,345 
Preston 15,385 
Niagara Falls 63,054 
Welland 42,622 
Guelph 56,603 
Hamilton-Dundas 321,277 
Aurora 12,338 
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Income 


1969(b) 
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(a) 


TN 


42 
43 
44 
45 
46 


Toronto(c ) 1,253,000 6,741 


Newmarket 15,267 _ 
Richmond Hill 27,339 _ 
North Bay . 38 966 5,990 
Sudbury- 

Copper Cliff 92,131 6,371 


Source: Estimated from 1966 and 1971 data provided in 
Ontario Population Statistics, 1971, prepared by 
Planning Research Section, Municipal Planning and 
Development Branch, Urban and Regional Development 
Divisione 


Sources: Inside - Taxation, 1972, a publication of the 
Department of National Revenue and Taxatione 


Includes Agincourt, Etobicoke, Don Mills, New Toronto, 
Scarborough, West Hill, Willowdale, ana Toronto 
Propere 
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x3 


x4 


x5 


X6 
X7 


x8 


x9 
X10 


xii 


X12 


TN 


TABLE 3 


PARK CHARACTERISTICS CODES 


Variable Description 


Number of acres 


Number of camping units 


Length of swimming beach 


Park class 


Acres of picnick area 


Miles of hiking trails 


Availability of modern 
comfort stations 
Availability of museums/ 
exhibition centers 
Availability of boat- 
launch ramps 
Availability of boats 
for hire 

Availability of trailer 
sanitary stations 
Availability of showers 
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Codes 


Oe Oe Oe Oe OMe OMe Ome as OMe Oi Oe mo Oe e O  O Me OO Oe 


i 


ou it 


0-183 acres 
184-849 acres 

850 acres or more 
0-100 units 
101-275 units 

276 units and over 
0-800 feet 
801-1900 feet 
1901 feet or more 
Recreational 
Natural Environment 
Primitive 

O-2e5 acres 
2e6-35 acres 

36 acres or more 
O miles 

1 mile or more 
Yes 

No 

Yes 

No 

Yes 

No 
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TABLE 4 


CODED CHARACTERISTICS OF 50 
ONTARIO PROVINCIAL PARKS 


Coded Park Characteristic 


Park x1 X2° X3 X4 XS X6 X7 X8 X9 X10 X11 X12 
Rock Point 2. 1 2 1 2 1 2 3 1 2 1 2 
Mara 1 1 1 1 2 1 1 3 v7: v) 1 2 
Sibbald Point 2 3 3 1 3 2 1 1 1 2 1 2 
Earl Rowe 3 3 Be 1 3 1 1 3 1 2 1 2 
Craigleith 1 2 3 1 2 1 1 2 2 g: 2 2 
Inverhuron 2 3 3 2 2 1 2 3 j eos 2 2 
Point Farms By 2 2 1 2 1 1 3 1 2 2 2 
Sauble Falls 1 eae tt 1 2 1 1 3 2 2 1 2 
Bass Lake 1 2 ok 1 2 1 1 3 1 2 1 1 
Selkirk 1 2 2 if 2 1 2 3 1 2 yt 2 
Devil's Glen 1 1 az 1 2 st 2 3 2 2 2 2 
Turkey Point 2 3 ie 2 2 1 1 3 1 2 1 2 
Wheatley 2 ‘2 3 1 2 1 1 3 1 2 1 2 
Rondeau 3 3 3 2 3 2 2 1 1 2 1 2 
Holiday Beach 2 1 2 1 3 1 1 x 1 2 1 2 
Ipperwash 1 2 2 1 2 1 1 3 1 2 1 2 
Long Point iz 3 2 1 Z 1 1 3 1 2 1 2 
Pinery 3 3 3 2 Z 2 2 3 2 1 2 2 
Antoine 1 1 1 1 P) 1 2 3 1 2 1 2 
Samuel 


de Champlain 
Marten River 
Mississagi 
Chutes 
Killarney 
Windy Lake 
Fitzroy 
Rideau River 
Silver Lake 
South Nation 
Six Mile Lake 
Balsam Lake 
Darlington 
Emily 
Presqu'ile 
Serpent Mound 
Arrowhead 
Grundy Lake 
Kitlbear Pt 
Mikisew 
Restoule 
Sturgeon Bay 
Algonquin 
Carson Lake 
Bonnecherre 


WDE OFP WORF WOWORF OF DOF RP RPP bw wD WH & © 
RBPePOmMNUNWOHONNONWHONRHHNH HH KE EHH 
NF OP WNHKHWOKHRKFWONNNHN FRE KDR ON HDHD HD 
= mm em hm bm HD PD = fe ee ope oe oe ee oe bo be bd 
Bem rFeEDONH FPN WNWNNNKENDY WO PHD Wb 
Oe Oe Oe Oe Oe oe ee 
RMBPONNHNHHKH BEBE HB BD eb bd hb bd & bo 
OWOWWHWHWHWNRP HOR WWHHWWHWWOWOWWN 
a al ot ed Se Oe 
WNHOKHHONHNHNNKHNHNNHNNHNNNNNKHNKHKHNH KP EY DY 
et ce A oo co oe oA oT on oe oT oe oe oo Oo eo oe on ol 
WHOKHHNHNKHNHNNNNHNNHNKHNNKHNHNNKHHKHNHNKH NHK NHN 
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Black Lake 1 2 a 2 2 1 1 3 1 2 1 2 

Bon Echo | 3 3 1 ~ 2 2 3 1 2 1 2 

Lake Ste Pete i 1 2 2 2 2 1 3 1 a 1 2 

Outlet Beach 2 3 3 1 3 1 1 3 1 2 1 2 

Oastler Lake i 2 1 1 1 1 2 3 1 2 7 2 

Driftwood 2 1 | 1 1 1 2 3 1 2 i 2 
TOTOTOTOLS: 


EMPIRICAL RESULTS 


The results of applying the previously described two- 
stage methodology in the analysis of the Southern Ontario 
data are described in this sectione Results for each stage 
of the analysis are given separatelye 


Stage 1 Results 


Parameters of Equation 8, the transformed version of 
Equation 6, were estimated both for “unadjusted” and 
"adjusted" vli,sj) datae The distance function 
h€d( i,j) )=d(i,j)**8 was used in both casese The estimates of 
K and £# are given in Table 5e Recall that unadjusted data 
include both main destination and stopover campers; while 
aajusted data are estimates of main destination campers 
onlye 


<P<>< 99> 


TABLE 5 


STAGE I ESTIMATES OF K AND 8 


Parameters 
K B 
Unadjusted data exp( 9-878 ) -1.291 
Adjusted data exp( 10.770) -1.579 
OTS OTSOLOLO. 


As suspected, there is considerable correspondence 
between the two sets of resultSse The magnitudes of the 
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estimated values of K are of Little importance since K acts 
merely as a scaling parameter in the modele 

As mentioned before, the parameter @ connotes certain 
behavioral implications; hence, the sign and magnitude of 
its estimated value are of considerable importancee In this 
analysis, the estimates of @ using unadjusted and adjusted 
data were both significantly different from 0 (beyond the 
0-0001 tLevel)e The negative a in both cases is in 
conformance with the usual hypothesis that visitation isa 
decreasing function of distance; that is, all other things 
being equal, the farther a park is ltlocated away from a 
population center the fewer will be its visitorse The 
numerical estimates of a obtained provide a clue as to the 
particular rate at which visitor flow declines with distance 
in the system under studye 

The estimated emissiveness,; EC i), and attractiveness, 
A(j)» parameters are given in Tables 6 and 7 respectivelye 
Again, results are - given for analyses which used both 
unadjusted and adjusted v(i,sj) observationse Since the 
estimates obtained by using the analysis of convariance 
procedure to estimate Equation 8 are in natural Logarithms, 
transformed results are also providede Correspondence 
between the adjusted and unadjusted results is again 
aprarente This is not surprising in view of the findings 
reported in the Appendixe Nevertheless, due to the crudeness 
of the main destination/stopover adjustment process’ there 
are bound to be some irregularities in these tables, 
especially Table 7e For instance, Chutes gets a relatively 
high attractiveness value for main destination use even 
though from an independent analysis it appears that this 
park, being adjacent to the Trans-Canada highway, serves 
primarily stopover userse The number of these possible 
irregularities is hopefully quite small and one would 
suspect that the AID results would not be very much 
distortede 

Using the appropriate distributional assumptions, it 
was determined that the proportions of the total variance in 


the data accounted for by population centers in the 
aggregate and by parks in the aggregate wre significantly 
different from O-« Tests of specific contrasts of 
emissiveness and attractiveness parameters were not 
conductede 


The results are interpreted as followse First, distance 
(location) is an important determinant of park use at a 
particular site and should be explicitly considered in 
decisions concerning the location of parkSe Tt tes of 
course, no great surprise to learn that park use decreases 
as distance from population centers increases; any other 
result would be looked upon with a considerable amount of 
skepticisme The importance of this result in this study is 
that by use ot the function h(d(i,j)) with an estimated £8, 
the vli,sj) data are essentially "corrected" to eliminate the 
distance effecte To put it another way, the location effect 
due to distance (as indicated by the estimated value of ®#) 
is essentially removed from the vli,j) data so that the 


IN 4 CHAPTER II page 116 


TABLE 6 


TWO SETS OF CAMPING EMISSIVENESS INDICES 
FOR 46 ONTARIO POPULATION CENTERS( a) 


Study 
Code Population 
Rank # Center Ln(£) 
1 42 Toronto 3e697 
Hami lton—- 
2 40 Dundas 1.943 
3 18 London 1-876 
& 2 Ottawa 1-796 
§ 5 Windsor 12.738 
Kitchener— 
6 35 Waterloo 1.534 
wT | 8 Burlington 12215 
8 1 Brantford 1.057 
9 22 Oshawa 0-811 
10 15 Sarnia 0-778 
11 10 Oakville 0-684 
12 39 Guelph 0.662 
13 37 Niagara Fall 0.528 
14 24 Woodstock 02499 
a5 25 Brampton 0.448 
16 6 Kingston 02438 
17 34 Galt 02367 
Sudbury-— 
18 46 Copper Cliff 0-319 
19 26 Peterborough 02312 
20 38 Welland 02209 
21 13 Chatham 06205 
: 22 3 Ste Thomas -0.003 
23 45 North Bay -0.069 
24 29 Barrie —-0.278 
28 11 Belleville -0.296 
26 9 Georgetown -—-0.-411 
27 41 Aurora —-0.568 
28 17 Brockville -0.574 
29 36 Preston —-0.-630 
30 23 Whitby —-0.654 
Cm | 44. RichmondHill-0-686 
32 21 Ajax —00e747 
Cobourg- 
33 20 Port Hope —-0-755 
J34 14 Wallaceburg -—0.802 
35 ‘7 Owen Sound -0-815 
36 4 Leamington —-0.-851 
37 43 New Market -0e876 
38 32 Cornwall —-0.886 
TN 4 


CHAPTER II 


Adjusted Data 


£ 


40 326 


6-980 
60527 
62025 
Se 686 


4e637 
3e 370 
20878 
20250 
20177 
1-982 
1.939 
1.696 
1-647 
1 565 
1.550 
12443 


1-376 
1.2366 
1.2232 
12228 
0.997 
0-933 
06757 
0-744 
02663 
0« 567 
06563 
0.533 
0.520 
0-504 
0.474 


0-470 
02448 
O« 443 
00427 
0-416 
0-412 


Uln(E ) 


30962 


20027 
20097 
20142 
20126 


1.659 
12309 
12266 
0-864 
1.019 
00722 
00717 
00678 
02588 
06450 
020545 
00396 


00435 
02344 
00294 
00322 
02058 


—-0-029 
—-00e313 
—00e331 
—-0.449 
—-0.625 
—0.681 
—-00e678 
-—0e837 


—-0-863 
—-0-853 
—-02952 
-0-980 
-1.029 


Unadjusted Data 


E 


$2562 


7e591 
82142 
8e516 
8e38i 


Se 254 
30702 
30547 
20373 
20770 
22059 
22048 
1.970 
1-800 
12568 
10725 
1.486 


1.545 
1e411 
106342 
12380 
1-060 
0-971 
O0e731 
0.718 
0.638 
06535 
02506 
0.2508 
0-467 
0-450 
06433 


02422 
0-391 
00426 
0-386 
00375 
00e357 
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39 19 Simcoe —-0-999 Oe 368 —-1.116 06328 


40 12 Trenton —-1.012 06363 —-1.162 06313 
41 31 Orillia —-1.168 Oe 311 —-1.272 0-280 
42 33 Lindsay —-1.2317 02268 ~1.488 0-226 
43 28 Pembroke —-1.408 0.245 —-1.602 0-201 
44 27 Hawkesbury —1.563 0-210 —-1.957 02141 
45 16 Smith Falls -1.713 02180 —-2-067 0e127 
46 30 Midland —-2-031 0e131 —-22235 0-107 
(a) In this table the population centers are ranked 


according to emissiveness values using adjusted datae 
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estimated E(i)*s and AC j)'s represent the quantities that 
would be found to . prevail if distance between every 
population and every park were equale This interpretation of 
EC i) and A( Jj) is completely consistent with the 
interpretation provided in the operational definitions of 
these concepts provided earliere 

One conclusion from these results is that the nature of 
the population centers having access to a park should be 
explicitly considered in planning decisionse Take any two 
population centers, for example London and Hamilton-Dundas 
to the east of Londone The Stage I analysis showed (Table 6) 
that these two population centers are approximately equal in 
emissivenesse This means that if a park were equally distant 
from each of these population centers, ceteris paribus, it 
would be expected to receive the same number of visits from 
London that it would receive from Hamilton-Dundase On the 
other hand, consider the comparison between London and 
Sarnia, a population center to the west of Londone It is 
seen from Table 6 that London is about three times’ as 
emissive as Sarniae A park equidistant from these two 
population centers would therefore receive a greater share 
of its visits from Londone If virtually the same park were 
being contemplated for location in either of the above 
sites, its use by the three population centers would clearly 
be maximized at the former site, ieCeys at a position 
equidistant from London and Hamilton-Dundase (Of course, the 
real-world analysis is not so simple since many other 
population centers are involved and the ceteris paribus 
clause is in general violated; the distance or tlocation 
effect must be incorporated explicitly into the analysise ) 

It has been shown that the nature of parks 
Significantly influences use patternse Take any two 
provincial parks, say Rondeau and Black Lakee The Stage I 
analysis showed (Table 7) that Rondeau is about twice as 
attractive as Black Lakee This means that, ceteris paribus, 
Rondeau would draw twice as many visitors from a city as 
would Black Lake, indicating that Rondeau is twice as 
"pnopular" based upon the analysis of these datae Other 
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similar comparisons could be made using the results depicted 
in Table 7. 


Stage II Results 


| The questions raised by Stage I revolve around what 
makes one population center more or less emissive than 
otters and what makes one park more of less attractive than 
otherse To answer these questions multivariate analyses were 
performed in Stage II to relate the emissiveness and 
attractiveness indices of Tables 6 and7 to particular 


characteristics of population centers and parks, 
respectivelye 
Emissiveness Analysis 


) Due to the Limited amount of data available on 
) population center characteristics, it was decided that 
regression analysis was the appropriate method of relating 
camping emissiveness indexes to population center factorse 
Two regression runs were madee The first run tested the 
relationship 


(9) YC i) = bO + bi UnP(i) + (i) 
WHERE 
¥(i) = Un (CECi)) 
P(i) = population of center i 
bO, bl = parameters 


using all forty-six population centers as observationse The 
second run tested the relationship 


(10) Y¥Ci) = bO + bICLnP(i)) + DACTnI(i)) + eCi) 


WHERE I1(i) represents average income of population center i 
and b2 is an additional parameter. In both of the runs, 
emissiveness indexes obtained from Stage I using adjusted 
v(i,j) data were usede The BMD stepwise regression program 
was employede Table 8 contains the results of the two runSse 


It is seen that population is an overwhelmingly 
important and statistically significant explanatory 
variablee Results of run 1 (with 46 observations)) and the 
first step of run 2 (with 30 observations are similar and 
indicate that in this outdoor recreation system, population 
accounts for about 76 percent of the total variance in 
emissivenesSe Furthermore, it is clear that emissiveness 
appears to be directly proportional to populatione From step 
2 of run 2 it is seen that the income variable is 
unimportante 

A plot of tUn(£(Ci)) versus tl2Un(P(i)) using all 46 
observations shows the functional relationship obtained from 
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TWO SETS OF CAMPING ATTRACTIVENESS iNDICES 


TABLE 7 


FOR SO ONTARIO PROVINCIAL PARKS(a ) 


Provincial 


Park 


Algonquin 
Killbear Poi 
Grundy Lake 
Pinery 
Oastler Lake 


Outlet Beach 


Bon Echo 
Chutes 
Marten River 
Mikisew 

Bass Lake 
Presqu'itle 
Restoule 
Rondeau 
Arrowhead 
Sauble Falls 
Sturgeon Bay 
Serpent Moun 
Six Mile Lak 
Inverhuron 
Ipperwash 
Rideau River 
Samuel de Ch 
Silver Lake 
Craigleith 
Balsam Lake 
Emily 

Mara 

Fitzroy 
Sibbald Poin 
Driftwood 
Long Point 
Black Lake 
Darlington 
Mississagi 
Killarney 
Earl Rowe 
Point Farms 
Turkey Point 
Lake Ste Pet 
South Nation 
Carson Lake 
Bonnecherre 


Adjusted Data 
n(A4) a 
22952 19.144 
1.2805 6-080 
1.729 §e635 
1.356 3-881 
1.2186 3e274 
02867 2e380 
0-771 22162 
02747 2e1il 
02730 22075 
0-665 1.944 
0.635 1.887 
02582 1.790 
02451 12570 
02433 1.542 
00425 12530 
02367 1.443 
02305 1.357 
02290 1.2336 
00274 12315 
006232 1.261 
02199 1.2220 
0.132 1.2141 
00131 1.2140 
02080 1.083 
0-080 1.083 
02049 1.050 
—-0.044 0-957 
—0.102 0-903 
—-0.128 0-880 
—0-2157 0.855 
-0.166 0-847 
—-0.190 0.827 
—02256 02774 
—-02363 06696 
—-0.6373 0-689 
—0.376 0-687 
—0.489 0.613 
—02653 0-520 
—0-838 0.433 
—0-895 0.409 
—-0-918 0-399 
—-1.-158 02-314 
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Unadjusted Data 


ln(4) 


3e115 
1.914 
1.924 
1.2349 
12248 
0-861 
02833 
1-020 
0-990 
0.747 
02673 
0.2638 
0.465 
02446 
0-528 
0-316 
0-369 
02323 
02282 
0-128 
02214 
0-332 
0-287 
02143 
02102 
0-000 
—-0-052 
-0.099 
-0.159 
-0-039 
—-02330 
-0-233 
—-0.333 
-0-341 
-0-345 
-0-303 
-0e579 
—-0.790 
-0-e915 
—-0-937 
-1.301 


a 


220533 
6-780 
60848 
30854 
3-483 
20366 
22300 
20773 
2e691L 
2e1il 
1.960 
1.893 
1.592 
1.562 
1-696 
1-372 
1.446 
1-381 
12326 
1.2137 
1.2239 
1-394 
12332 
1.2154 
1.107 
1-000 
00949 
0-906 
0-982 
0-853 
0-962 
0-719 
02792 
0.717 
Oe711 
0-708 
02739 
0-560 
02454 
02369 
0.401 
0-392 
00272 
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44 25 Windy Lake -1.183 02306 —1.466 00231 


45 11 Devil's Glen -14215 02297 —-1.326 02266 
4S 13 Wheatley —-1.215 0-297 —-1.447 02235 
47 15 Holiday Beac -1.219 02296 —-1.472 02229 
48 19 Antoine —-1-349 02259 —-1.582 02206 
49 1 Rock Point —-1.551 00212 -1.668 0.189 
50 10 Selkirk —-22203 0.2110 —-2e519 0-081 


(a) In this table the parks are ranked according to 
attractiveness values using adjusted datae 


KP <P> <KP<LDKLD 


TABLE 8 


EMISSIVENESS REGRESSION RESULTS 


Se Ee of 
Run Step Parameters Stde Error Re Estimate 
1 1 bO= —- 92742 - 0207677 025588 

bi = 00926 0.07676 
F 1 bpbO= -—10-689 - 027563 0.5794 

2 bO= —- 8-808 - 

bi = 1.2017 0212749 027566 0.5898 

b2 = - 0e229 1-38526 

OL OLOTOTLO- 
run 1 is not as good as one might expecte Extreme values 
are fit well, but there is still much dispersion about the 
prodicted straight lLinee Examination of the residuals 
uncovered no unusual departures from the assumed 


distribution; and no sub-regional pattern of residuals were 
apparente 

That population is such an important influence in 
recreation trip-making is not surprising; this result 
merely confirms what many have intuitively sensed over the 
yearse It may be possible, by disaggregating population in 
different ways, or by including additional variables, or by 
using different functional forms, to account for additional 
Variance in emissiveneSsSe But it can be effectively argued 
that, based on the above results, these refinements could 
easily be judged unnecessary; especially in view of the 
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crude nature ot the population estimates used in the 
analysise 


Attractiveness Analysis 
Data on park characteristics were somewhat more 


plentiful than was the case with data on population centerse 
Therefore, it was reasonable to employ AID to find those 
park characteristics that were most closely associated with 
differences in attractivenesse The particular park 
characteristics employed in the analysis were given in 
Tables 3 and 4. 

For exploratory purposes, results were obtained for 
both adjusted and unadjusted Stage I attractiveness resultse 
It turned out, however, that the Stage II results were 
nearly identicale Thus, only results from using the 
adjusted data are reportede The AID tree diagram for the AID 
analysis using adjusted data is given as Figure 2e (It is 
important to note that in this run the stopping rule used 
was such that as soon as a split resulted in a subgroup with 
fewer than twelwe observations, no further splits were 
allowede ) 

Figure 2 is interpreted as followse Group 1 (Gi) is the 


total population of 50 observations (ieee, ni = SQ)e The 
mean value of the dependent variable, the logarithms of the 
Stage I attractiveness estimates, is ml = O0e0356 After 


examining allt possible binary splits for each possible 
“independent" variable in Table 3, the maximum reduction in 
the unexplained sum of squares is obtained by splitting 
Group 1 on the basis of "total park acreage". All those 
parks with 850 acres or less are in Group 2 (n2 = 35)3 all 
those parks with greater than 850 total acres are in Group 3 
(n3 = 15)e The mean attractiveness value for Group 2 is m2 = 
—-0e241; the mean attractiveness value for Group 3 is m3 = 
0e681- The horizontal axis of the tree diagram is scaled 
accordinglye 

After Step 1, additional binary splits are formed by 
applying exactly the same procedure to Groups 2 and 3 that 
was applied to the original set of data, Group Ile Then the 
procedure is again applied to the subgroups formed at the 
end of Step 2; and so on down the linee The vertical axis of 
the tree diagram thus shows the groups that were formed at 
each step of the analysise For example, Groups 4 and 5 were 
obtained by splitting Group 2 on the basis of “number of 
campsites". On the other hand, Groups 6 and 7 were formed by 
splitting Group 3 on the basis of "Length of swimming 
beach". Thus while total acreage is the most influential 
variable in accounting for differences in park 
attractiveness, the second most influential variable is 
“number of campsites" for parks with a total acreage of 850 
acres or less and "Length of swimming beach" for parks with 
a total acreage of greater than 850 acrese This result could 
be interpreted ina number of different wayse Most Likely, a 
"Severe" interaction thas been detected3 that is, the 
relative importance of an independent variable depends on 
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Z 
sal 


the Levels of its predecessorse 

Severe interaction occurs whenever both the upper and 
lower branches of an AID tree split on different variables; 
fmild" interaction occurs when upper and lower branches 
split on the same variablee For example, continuing down the 
AID tree of Figure 2, it is seen that at high coded values 
of X2, the next-best variable is X3 (length of swimming 
beach)e Similarly, for lower codes of X2 the next best 
variable is X3Ge The difference between the average value of 
attractiveness in group 9 (iee€e;s —-0-224) and the average 
value of attractiveness in group 8 (ieee, 02-306) can be 
construed as a measure of the "main effect" of X3 at high 
levels of X2e The corresponding difference for groups 11 and 
10 gives an estimate of the main effect of X3 at low levels 
of X2- The difference between these two differences then 
provides the conventional statistical measure of the 
interaction between X2 and XdiJe If this difference of 
differences is negligible, then there is no interactione If, 
on the other hand, the difference between average 
attractiveness of groups 8 and 9 is markedly different from 
the difference between the average attractiveness values for 
groups i1 and 10, then the interaction is largee Under 
suitable assumptions, the statistical significance of the 
magnitude of this interaction can be assessede In the AID 
tree of Figure 2, the difference of the differences is equal 
to 0.255, and there is thus some evidence of a mild 
interactione 

Figure 2 shows that a total of six terminal groups were 
formed: these are groups numbered 6, 72 89 9, 10 and 11e 
Each of the original observations belongs to one (and only 
one) of these terminal groupse Table 9 gives some relevant 
information for each groupe The groups are arranged 
according to group meanSe Each terminal group is defined by 
a particular combination of codes for independent variablese 
For example, group number 9 of Figure 2 is seen to be 
defined by X1 = (1,2), X2 = (2,93)53 and X3 = (2,3)e¢ No other 
independent variables are required to define terminal group 
9e Enumeration of all terminal groups in the above fashion 
shows that even in this Simple structure, the terminal 
groups are defined by widely different combinations of 
characteristicse Table 10 presents the parks contained in 
each of these groupse Tables 9 and 10 illustrate a common 
and interesting result: two terminal groups (9 and 10) have 
mean attractiveness values that are approximately equale Yet 
each group is defined by a different set of characteristicse 

Splits were obtained on only three of the thirteen 
independent variablese Due to the relatively small sample 
size and particular stopping rule used in this analysis, 
further splits would be impossible to obtaine It is possible 
to conclude, however, that the three variables for which 
splits were obtained are the most important discriminatory 
variables in this sample; it is conceivable that with a 
larger sample size, other tess important variables might be 
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TABLE 9 


TERMINAL GROUPS FORMED BY AID 
CLASSIFICATION TECHNIQUE 


Terminal Mean Defining 


Group # Value Characteristics 
# Observa- Attractiveness 
tions Log Transfe 
7 9 1.071 2-918 X1=(3),xX3=(3) 
8 8 02306 1-358 X1=( 1,2),X2=(2,3), 
X3=(1) 
6 6 0-097 1.2102 X1=( 3),X3=(1,2) 
9 14 —-0.224 02-799 X1=( 1,2),X2=( 2,3), 
X3=( 2,3) 
10 7 —-02233 0-792 X1=(1,2),X2=(1), 
X3=( 1) 
11 6 -1.013 0-363 X1=( 1,2),X2=(1), 
<>< ><><K> 


introduced in later stagese 


SOME PRACTICAL APPLICATIONS 


It was mentioned previously that results of successful 
applications of the proposed two-stage procedure for 
identifying important factors accounting for variations in 
aggregate outdoor recreation trip-making behavior would be 
useful to both planners and researcherse 


An Application for Planners 


An important park planning function is the optimum 
design of park facilities subject to physical, financial and 
political constraintse In the design of new parks, or of 


. expanded versions of existing parks, the planner is faced 


with choicese For any particular park he is, of couse, 
interested in ascertaining the visitor implications of each 
of several alternative design possibilities: he would like 
to know for each design possibility how a park would compare 
with other parks in terms of attractivenesSe Results such 
as those provided by this analysis are ideally suited for 
making these kinds of judgementse 

The AID tree shown in Figure 2 may be used to classify 
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HOMOGENEOUS GROUPINGS OF 


PARKS 


TABLE 10 


50 ONTARIO PROVINCIAL 
AS OBTAINED BY AID ANALYSIS 


Mean 
Terminal Attractiveness 
Group Log Transf Park Group 
7 1-071 2918 Algonquin Presqu’iltle 
Killbear Pte Restoule 
Grundy Lake Rondeau 
Pinery Earl Rowe 
Bon Echo 
8 02306 12358 Oastler Lk Six Mile Lk 
Bass Lake Silver Lake 
Sauble Falls Fitzroy 
Serpent Md Black Lake 
6 0e097 1-102 Marten River Balsam Lake 
Arrowhead Mississagi 
Samuel de C Killarney 
9 -0.224 0-799 Outlet Beach Sibbald Pte 
Mikisew Long Point 
Invenhuron Darlington 
Ipperwash Point Farms 
Rideau River Turkey Pte 
Emily Wheatley 
Craigleith Selkirk 
10 —-0.2233 02792 Chutes Carson Lake 
Sturgeon Bay Devils Glen 
Mara Antoine 
South Nation 
11 -1.013 02363 Driftwood Windy Lake 
Lake StePeter Holiday Bhe 
Bonnecherre Rock Point 
<>< ><> <> 
new observations, ie@ey new or improved parks, using only 
values of the park*ts relevant independent variablese 
Suppose, for example, that the coded values of X11, X2 and X3 
are given by 1, 3 and 1, respectively for a new parke The 


park might have, 
Swimming beach that 
yields an 


for example, 


700 acres; 
was tifty feet in lengthe 
estimate of the 


200 campsites and a 
The AID tree 
park's attractiveness 


simply by 


determining to which terminal group the new observation 
belongse Beginning at the top of the AID tree, it is seen 
that the Left branch is taken because X1 = 1 for the new 


parke 
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At the next split the right branch is taken because X2 
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= 2e The next branch yields terminal group 9 because X3 = 
2e The predicted value of attractiveness for this 
hypothetical park is graphically seen to be approximately 
-0.224 in logarithmic termse Thus an informed estimate of 
attractiveness is obtainede 


If no information were available on the park's 
particular characteristics as compared to the 
Beharacteristics of other parks that give rise to 


differential attractiveness values, the best estimate that 
could be made would be to assume the average value of the 
| total sample, which from Figure 2 can be seen to be equal to 
0.035 (in logarithmic terms)e This is, ina sense, an 


"unconditional" predicted values When the unique 
| characteristics of the new observation are used in the 
assessment, the "conditional" prediction thereby obtained 


represents a substantial refinement over an unconditional 
predictione In this particular case it was seen that a park 
with the characteristics cited above is less attractive than 
the average parke Translated into absolute numerical scales, 
the implied ditference in attractiveness magnitudes is about 
OeBsle Interpreted, this implies that (all other things 
being equal) the new park would draw about eighty percent of 
the visitors that the "average park" would drawe Planning 
can be performed more effectively by making use of this 
informatione 


An Application for Model Building 


Several approaches for relating park use to the factors 
which “explain" it were alluded to in the Introductione It 
was noted that researchers have attempted to construct 
models which use as dependent variables various measures of 
park use and as independent variables various suspected 
influencing factorse Independent variables that are specific 
to population centers and to parks are, of course, generally 
considered to be very relevante But, due to the virtually 
infinite number of possible combinations of variables that 
can be hypothesized to characterize population center and 
park effects on outdoor recreation park use, the selection 
of particular variables to include in the models is a 
difficult undertakinge As a result, subjective ad hoc 
selection procedures are often adoptede The result provided 
by this study can be viewed as providing a somewhat 
objective basis for selecting these relevant variablese 

Viewed in this general sort of way; the overall results 
of the study can be taken as being supportive of previous 
modeling efforts that have used population and park size- 
related measures as independent variablese It was found, for 
instance, from regressions on the £(i)'s that population 
size by itself is a very good indicator of population center 
emissiveness (ieeC€ey participation) and from AID analysis on 
the AC j)"s that acreage and other size-related 
characteristics of parks are reliable indicators of 
attractiveness (ieee, popularity )e These measures may now be 
used with some justification in models of recreation trip-— 
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making whereas (before) their use was based strictly on what 
might be termed “informed judgment". 

Concentrating on the AID analysis, attractiveness 
results, in particular, can serve to lead future research 
and model-building investigations into potentially fruitful 
directionse For example, regarding the AID classification 
tree of Figure 2, it was pointed out that both mild and 
severe interaction existed in the system under study, and 
that no unique set of variables would be expected to suffice 
for the modeling of visitor flows to all parkse On the basis 
of these findings the researcher might proceed to perhaps 
develop separate regression models for observations in each 
terminal box and thereby obtain coefficients with which to 
judge the relative partial effects of the associated 
independent variables on each subgroupe 


GENERAL CONCLUSIONS 


The objective of the analysis was to put to test a 
proposed methodology for drawing certain inferences about 
important factors influencing park usagee The methodology 
proved to be a promising way of exploring emissiveness, 
attractiveness and Location -— questions that have 
traditionally plagued park planners and analystse 

As far as specific numerical findings are concerned, 
two results become cleare Evidence was provided that 
population size is by itself a useful indicator of 
participation differences in the region under studye There 
is thus some justification, then, for using a population 
measure as a basis for park use modelse 

On the other side of the coin, park size was seen to be 
a most important factor in attracting trips to a particular 
recreation areae It was shown that Large parks attract Large 
numbers of people relative to small parks, all other things 
being equale Further, the number of camping sites appears to 
be a very important variable for small parks, whereas length 
of swimming beach is only important in large parkse It can 
be concluded that size-related measures are useful as 
indicators of recreation site attractiveness in modeling 
etfortse The new two-stage methodology, while having many 
advantages over existing methodologies that attempt to 
perform Similar analyses, is not without its Limitationse To 
properly assess its ultimate usefulness, it would be 
necessary to conduct other tests on a more expanded data set 
Any sa large number of regionSe But it is possible to 
systematically list some of the advantages and 
disadvantagese 


Advantages and Limitations 


The proposed procedure is, in a sense, economicale 
First, data requirements for the first stage are quite 
meager — only matrices (v(i»zj)) and (d(iyzj)) are requirede 
And if Signiticant differences do not appear in the 
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estimates (£(i)) and (A(j)) after application of Stage I, 
the second stage of the analysis need not be undertaken, 
consequently effecting a savings in data collection costs 
since population center and/or park characteristics data 


would not be requirede 


It is clear that classification is regarded as a 
central requirement of the approach tested in this reporte 
In this respect, the AID technique used in Stage II has many 
advantages over other methods of analysis in that it 
determines the joint effects of many variables by analyzing 
variance between grouped data by making the very minimal 
number of statistical and algebraic assumptionse Other 
methods such as regression analysis require very strong 
assumptions’ that are believed to be satisfied only 
infrequently in the analysis of nonexperimental datae Other 
advantages of AID, such as the detection of "mild" and 
"severe" interactions, have already been mentionede 

By far the most useful feature of the methodology under 
test here is the systematic extraction, from observed data 
on recreation choices, of factors influencing park usee That 
iS» the underlying assumption is that these factors are 
revealed by people who make observable choices in their 
recreational pursuitse This "deductive" sort of reasoning 
stands in contradistinction to the “inductive" reasoning 
that has characterized many previous modeling studiese 

There arey of course, many real Limitations of both 
Etages of the amnalysise In the first stage, transformation 
of the model (Equation 6) into logarithmic form produces 
biased estimates of its parameterse Estimation in Stage I by 
Equation 8 is difficult when there are many zeros in an 
observed spatial interaction matrix since the Logarithm of 0 
does not existe Finally, due to possible heteroscedasticity 
problems the hypothesis-testing procedures, if applied, can 
be rendered invalide The potential user of this methodology 
should take these considerations into accounte 

[With regard to the second stage of the analysis, the 
AID classification technique produces results that are not 
easily interpreted in the analysis of trip-making systems of 
small dimensione And since the method produces a sequence of 
dichotomous splits, only the best "Local" split is obtained 
at each step of the sequencee Thus a series of locally 
optimal decompositions is obtained where each split is 
contingent upon the previous onee It is possible that a 
variable "far down the line" could be almost as 
discriminating as one of the firste Had the program plit 
difterently in the beginnings a different AID tree would 
result (one resolves this problem by running the analysis 
several times with judicious elimination of variables )e The 
codification of independent variables for AID analysis must 
also be done judgmentallye Overly coarse coding loses 
information; overly refined coding can increase computing 
costs without substantially improving the resultse 


Recommendations for Further Research 
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All in all, the methodology described in this report 
appears promisinge Only by applying the method to other 
situations and comparing results with alternative methods 
can its usefulness be ascertainede The following 
recommendations for further study can be given: 


(1) This pilot study concentrated attention on only one 
region in Canadae Much can be learned about the 
{generality of the results by applying the two-stage 
methodology to analyze corresponding outdoor recreation 
travel patterns in other regionse 


(2) This pilot study concentrated attention on one visitor 
type - main destination camperse It would be 
interesting to compare these results with outdoor 
recreation travel parameters estimated for the other 
relevant visitor: type (main destination day-users) in 
the same regione. 


(3) This pilot study used data at its highest level of 
aggregation —- the total visitors travelling from ito j 
during some tong time periode It is possible to 
explore, by use of the proposed methodology, travel 
patterns of various population subgroupse For example, 
it is theoretically possible by use of CORD Study data 
to identify various characteristics, such as age and 
income, for each visitor groupe It is also possible to 
pinpoint dates of travele Understanding of outdoor 
recreation travel patterns might be enhanced on the 
basis of some of the above identifiable 
characteristicse Perhaps even an analysis of individual 
trip-makers could be attemptede 


(4) The Stage I analysis produces biased estimates of 
Parameterse Work is proceeding by this author toward 
the development of an estimation technique not fraught 
with the many problems of using an alysis of covariance 
on a transformed version of the postulated modele When 
this work is completed, the new methodology should be 
used to at least re-estimate the parameters documented 
in this reporte 


(5) Stage II results using the AID technique as a 
statistical tool would call ‘for large numbers of 
observationse Its usefulness in the outdoor recreation 
context is therefore jeopardizede Nevertheless it is 
possible and desirable to increase the data base toa 
much larger size by incorporating data pertaining to 
parks under the jurisdiction of other agencies (eC€egey, 
various Conservation Authorities )e In this regard, work 
Should proceed on coordinating data collection efforts 
with these agenciese 


(6) The two-stage methodology presented in this report 
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attempts to, first, uncover the structure of some 
existing data and, second, interpret this structure so 
that we may come to an understanding of how it might 
have come aboute Other methodologies, some of which 
were mentioned in the Introduction, purport to answer 
some of the same questionse (To this author's knowledge 
no other procedure addresses all of the same 
questions )e A useful exercise would involve a 
comparison of corresponding results obtained from this 
study with those obtained from applying other 
methodologies to the same or similar datae 


APPENDIX 


MAIN DESTINATION/STOPOVER ADJUSTMENT FUNCTION 


| Since the camper data provided by ODLF and used in this 
study do not distinguish main destination and stopover use, 
a simple adjustment procedure was applied to the vli,j) data 
in order to separate these two types of usese Basically, it 
is hypothesized that stopover use, as a percentage of total 
use, declines as the distance between population center and 
park increasesSse Therefore, v(isj) observations associated 
with large distance measures would have a greater stopover 
component than observations that are associated with small 
distancese A function expressing this ideas is: 


(AL) vi€i,gj) = vlisj) explad(i,j)) 


where v(i,j) is the observed number of total camper trips 
from ito j,» d(i,szj) is the associated distance and v'(i,j) 
is the estimate of main destination component of vli;j)e The 
parameter a must be empirically estimatede Note that if 
d(i,j) is small, v'(i,j) is close to vli,sj) and as d(i,j) 
gets larger v'(i,j) approaches Oc 

Data from the CORD Study surveys were used to estimate 
Ae Population center to park data for twelve parks were 
employed in the analysis; a total sample of 163 observations 


was usede It was found that a = -0.004 gave the best 
results, and was Significant at the 0-01 levele 
Unfortunately, the RR? value associated with this estimate 
was quite low (R= = 0240)e The estimated value of a = -—0.004 


does, however, Suggest a "half-life" of about 200 miles, 
that is, when d(i,j) = 200 in Equation Al, then v'(i;zj) is 
approximately equal to vli,»j)/2 and approximately one-half 
of the observed number of campers are presumed to be main- 
destination users, the other half being stop-overse 

Each of 2300 (ieee, 26 x 50) vlisj) observations were 
adjusted using Equation Al with a = — 0.e004- Due to the 
relatively low R* walue, there was no overwhelming evidence 
that this procedure provides realistic estimates of main 
destination users; there are undoubtedly many other factors 
besides distance which might be included in the adjustment 
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modele Yet this procedure can be defended on conceptual 
grounds as well as on the basis that, given the significance 
of the regression results, some adjustment based on these 
results was more realistic than no adjustment at alle 

In the trip data there were many v(i,j) observations 
that equalled OQ. The prevalence of O's presents a problem in 
the log-linear Stage I analysis which uses logarithms of 
v(i,zgj) observations as dependent variablese There is no 
clear-cut resolution to this problem and a compromise 
procedure was employed to eliminate the possibility of 
taking the logarithm of zeroe Specifically, the Stage I 
analysis using adjusted (ieee, vili,szj)) and unadjusted 
(ieee, vligj)) data employed Ln(v'(i,j) + 1) and Un(€vli,j) + 
1) as dependent variablese 
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CORD STUDY TECHNICAL NOTE 11 


A MODEL OF VISITOR FLOWS CONSIDERING 
A BASIC PARTICIPATION FUNCTION 
AND AN "ALTERNATIVE FACTOR": 
SIMULATION AND PARAMETER ESTIMATION 


Je Beaman, HeKe Cheung; and NeHe Do 


ABSTRACT 


: This paper presents what may be called a generalization 
of the Cesario model, which is designed to show what the 
behavioural structure may be that is embedded in _ the 


parameter that Cesario has called city emissivenesse A 
function is introduced that is said to be a way of defining 
Cesario's emissivenesse This function is shown to be 


plausible in that it reduces to emissiveness in the way one 
might think of it in the ordinary Cesario model if a city is 
only served by one site (an isolated site)e Also, the 
function is constructed in such a way that people do not 
simply double the number of visits they make to parks 
because a second park is introduced into a system that 
already has one park even if the new park is as attractive 
as the original park and at the same distance from the given 
citye 

The estimation problems resulting from the form of the 
generalization are describede The authors report that 
Original estimation attempts were made which resulted in the 
discovery of a degeneracy problem in estimatione The nature 
of this problem is described and two approaches’ to 
determining the parameter of the generalized model which in 
the paper are called inherent emissivenesses of cities, are 
presentede The one method involves determining coefficients 
Similar to the original Cesario emissivenesses but this time 
corrected for availability of supplye The other approach is 
analagous to that explored by Cesario when he tried to 
explain his emissiveness values by aggregate socio-economic 
information on citiese 

It is not claimed that the modification to the Cesario 
model improves the model's overall effectiveness in 
explaining park usee It is pointed out that no more total 
variance in origin-destination flow data is explained by 
this generalization than by the original modele Rather, it 
is suggested that by eliminating effects of supply on 
Cesario's emissiveness coefficients it is made possible to 
explain what are called the inherent emissiveness 
coefficients by using socio-economic characteristicse These 
Parameters are indicated as the ones that should be known 
when the Cesario model is used to estimate park use in an 
altered systeme 

In the way of general critique, it is pointed out that 
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the Cesario model could be described as not being adequately 
successful in explaining people's general origin destination 
flow behavioure The generalized model is noted as being 
equally deficient since only the internal structure of the 
Cesario model has been modifiede Some indications are given 
about why both these models might be deficient so 
researchers can have a better feel for where the model 
proposed is applicable and in what other directions research 
might be focusede 


INTRODUCTION 


Cesario has presented a gravity model formulation to 
explain observed outdoor recreation travel patterns between 
cities (origins) and parks (destination) for overnight 
campinge The basic concepts behind this model are that: (1) 
associated with each city is an inherent “emissiveness", 
E(o), which is a (relative) measure of the degree to which 
people in that city will participate; (2) associated with 
each park is an inherent “attractiveness", A(d), which is a 
(relative) measure of the degree to which people will visit 


that park; andy (3) visits between any origin and 
destination decrease with the distance between theme The 
complete formulation of the model and procedures’ for 
estimating the E(o)'s and A(d)'s are given in CORD Study TN 
4 

Cesario suggests that the emissiveness and 


attractiveness parameters computed using his model can be 
analyzed by explaining them by a function of the socio- 
economic characteristics of the population in origin areas 
and a function of park characteristics, respectivelye Now, 
without getting into the issue of whether the suggestion by 
Cesario meant that he thought that the emissiveness 
coefficients calculated were dependent 

only on the socio-economic characteristics of the 
populations one may note that it is reasonable to propose 
that the emissiveness of a city depends on the amount of 
supply to which it is exposede It is this relationship that 
is of concern in this paper when the authors proceed to 
generalize the Cesario model in such a way as to explicitly 


introduce a "supply effect" into Cesario's emissiveness 
coefficiente 

There are a number of factors that can be 
conceptualized as influencing people's use of parks; the 
ones studied here can be called the "alternative" factor, 
and the factor of "supply generating demand". One may 


consider the following example as an aid to understanding 
what prompts the authors to suggest that there is a need to 
study the Likely influence of alternatives and "the Level of 
supply" on people's behaviour. Suppose a city which is 
supplied by a single park tor camping is given access to 
another site, with the second park equally as attractive as 
the first and adjacent to, or at an equal distance but in 
the opposite direction from the city as the first parke It 
is possible to conceive that total visits (demand) for the 
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two parks will be greater than total visits when there was 


only one parke But, on the other hand, the second site 
offers the same “good" or camping experience with a given 


attractiveness at the same price (distance, time, camping 


fee etce) as the first park, so "consumption" should not 
changee 


Does the original demand merely divide between the two 


parks? If not and it increases, does one have the 


29 de 


paradoxical situation of supply generating demand? 

Of course any paradox in the above example is explained 
away if one supposes that the alternative park provides 
variety, which stimulates demande Further, supply may effect 
people's awareness or knowledge of the activity, camping; 
eventually resulting in participation being related to level 


of supply as indicated in Figure le Supply factors 
frequently influence demand for goods in this manner (see TN 


When supply is small one might see that for a slightly 


larger supply there is a much greater demande From a dynamic 
and behavioural perspective, at some point awareness, 
variety etce are such that demand "takes off" in relation to 


supplye Each new facility that is added further serves to 
convince people that an activity is the thing to doe That 


is, it is the thing to do up until saturation is reachede 
Then again, Little demand expansion is to be expected when 
new supply is introduced into a systeme 

One may note that the type of "s-curve model" shown in 
Figure 1 is commonly used to explain the dynamic growth in 
sales when new products are introducede But » here the 
concern is not with a dynamic formulation: the concern is 
with the new equilibrium that will be reached once a new 
supply has been introduced into a system and people have 
"totally" adapted to the new configuratione 


MODEL SPECIFICATION 


Consider, for example, a centre supplied by two sites 
with a new site equally as attractive as an original site 


and, say, at an equal distance in the opposite directione It 


is possible to think that after the second site was created 
and its use reached equilibrium, total attendance to the two 
parks would not be the same as it was at the one park before 
creation of the second sitee In fact one could expect an 
increase in total use as suggested in Figure tie What is 
important about the example just introduced is that, in 


classical supply-demand terms, the new site offers the same 


product at the same travel-price, etce, so one might say 
that consumption should not changee However, the west Texas 


reservoir creation situation cited in Clawson-Knetsch (see 
Reference 12) or arguments based on increased utility of 


trips when a variety of destinations is available suggest 
that the original use does not merely divide between the two 


parks when the second park is createde Total attendance at 
parks is expected to increase: creating supply is expected 


to "create" demande 
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Deriving the model discussed here is a convenient way 
to expose the premises on which the SGD (supply generates 
"“demand") model is basede Accept that for an isolated site, 
the only site serving an origin with a certain experience 
for which there is no substitute, visitor flows are 
specified by Equation 1 below: 


(1) VGoed =) FC DGost ) ACA) AELO) 


WHERE 


V0ord) is the flow of some fairly homogenous type of 
visitors from origin,s 0, to a destination, dj; 


. 
D(Co,d) is the "distance" from o to d measured in appropriate 
| time-distance units; 


F( ) is a function expressing the way use varies with 
"distance"; 


A(d) is the attractiveness of the destination, d3; and 
AE(o) is the emissiveness of the origin, Oc6« 


Equation 1 is a special case of a more general equation 
discussed by Cesario (see TN 4)e It implies that the amount 
of visitor flow from a city to the only destination of a 
given kind that serves it for main-destination tris is 
dependent on the attractivity of the destination area (see 
TN 9 and TN 33 on the issue of destination area 
attractivity) and the distance to the destination, and ait 
this in the context of a certain type of person going to the 
given park for a given purpose (see TIN 33 for more general 
concerns )e 

But, one may ask, what happens when there are two parks 
of differing or similar attractivities at differing or 
Similar distances from a city that service the city ina 
Similar way tor a given type of main-destination visits (ege 
week-end recreational camping visits)? It is suggested that 
the total participation that will be generated when new 
Sites are entered into a system is not generally as great as 
if every site could be treated as an only site, so flows 
computed using Equation 1 could be addede Rather it is 
proposed that introducing a new site into a system wiltl (at 
equilibrium) usually cause total use to rise less than the 
amount that the new site would receive if it were the only 
Site, but total use trips will be more than there were when 
only the original site was theree 

The ideas already introduced make it convenient to 
state: 
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Assumption i It is possible and meaningful to speak of a 
city's isolated absolute emissiveness, AE(o), 
to a site of unit attractively at zero 
distancee 


In particular, one should note that since the distance 
tunctions used here are of the form exp(-—-aD(0,d)) there is 
no problem with use becoming infinite as distance goes to 
zero (as would be the case with Dlo,d)j) to some power )e 
Actually, one does not expect use of parks to increase very 
rapidly as one goes from being one quarter of a mile from a 
park to one eighth of a milee Nor does one expect use to 
become “infinite"™ because a person Lives ina parke As well, 
for large distances there are problems with the "distance to 
a power" function that do not arise when an exponential 
function is used (the nature of the problem is indicated in 
TN 14)e In this regard, behaviourally the exponential 
function used makes more sense than using distance raised to 
some negative power to explain number of trips based on 
distance travellede 

Assumption 2 following suggests what isolated absolute 
emissiveness really meanse 


Assumption 2 The isolated absolute emissiveness of a city 
to its isolated site is a parameter that 
expresses the aggregate effect of a number of 
factors on the tevel of participation in an 
activity that the city would generate to its 
isolated site of unit attractivity at zero 
distance and it may be defined by one of the 
following increasingly specific equations: 


(2A) AECo) = Flcity size, city~age-sex distribution 
information, eee ) 


(2B) AE(Co) 


CC iso) POP(oc) Gl% age-sex distribution,y 
%® income distribution, etce ) 


(2C) AE(o) C(2,0) (NCo) U + OE Blisj) nl iyJj)) 


WHERE now, as in TN 6, N(o) refers to the population of o 
and n(i,sJj) is the number of people in o who have lLlevel j of 
socio-economic variable ie 

In Equations 2B and 2C above the constants C(1l,0) and 
C(2,0) can be considered to reflect city uniqueness’7 that 
exists once total population and a population's socio- 
economic distribution have been considerede Also the reason 
for having Cll;syo) and C(2;0) in the above relates to 
problems of defining absolute scales and to identification 
problems in estimation, both of which are commented on 
subsequentlye 

In particulary Equation 2C for which a derivation is 
given in TN 6 implies that AE(o) has a simple form that 
involves the total potential for participation in an 
activity being proportional to the number of people in an 
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origin area and dependent on the socio-economic 
characteristics of that population (being poor, aged, etce )e 
In fact, one may suggest that the age, income, etce effects 
implied by Equation 2C are corrections that may be applied 
to any city (see TN 12) while city uniqueness is embodied in 
C( 2,0 )e 

The preceding assumption is important in accepting 
Assumption 3 which simply restates Equation l, taking into 
account the effect of distance to travel to a site and site 
attractivity: 


Assumption 3 If a city is serviced by a single site for a 
given activity that has no substitutes (see 
TN 33) then total emissiveness of a city is 
concentrated on a single site so that actual 
visitor flows can be calculated if distance 
and the attractiveness of the site are 
introduced as indicated in Equation 2: 


(2) VICo,d) = expl(-rDlo,d)) AC d)JAE( 0) 

exp(-—-rD(o,d)) AC d)C( 1,0 )JPOP( 0 )JG( 0) 

WHERE r is a constant which may be called the impedance 
of distance (TN 14), 


A(d) is the attractiveness of site dy 
Glo) is a correction for population distribution, and 


C(1,0) is the per capita isolated site emissiveness 
in the citye 


Now it has already been stressed that the primary 
concern here is generalizing Equation 2 to an expression 
that is appropriate when more than one site is considerede 
Yet the preceding suggests the need to present an expression 
that reduces to the isolated site equation (Equation 2) when 
only an isolated site is considerede Aliso, there is the 
problem raised earlier of not having the use of each one of 
several sites be the same as if they were isolated sites 
serving a city when they all serve a citye So what is needed 
is that an alternative factor of sorts be defined which 
expresses how origin emissiveness increases due to 
alternative destinations being available (on the effect of 
alternatives see TN 1, 3, 33 and sources cited there)e In 
particular, it is proposed that a form of the type given 
below is appropriate: 


Assumption 4 Attendance at a site from an origin is defined 
by Equation 3 B@dteoit ats accepted that 
destinations are such that iteviis (1) 
meaningful to say that they have an 
attractivity A(d), that is not influenced by 
other sites being in the system (this is 
relaxed in TN 33), (2) that single purpose 
main destination visits are made to the 
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destinations and (3) capacity of destinations 
is large enough that it does not influence 
use levels by affecting attractivitye 


(3A) Wlosd) = expl—rDlogd))ACd)(PP(COo) / OE Plo,gd) D(C aE 
(P(lo,d)/ PP(o)) **SE)**SM POP( 0 )CG(o) 


WHERE 
all sums are over dy and fry, DC ogd),y AC d) and 
POP(o) are as defined before, 


P(o,yd) is the product A(dJexp(—-rDl(o,d)), and 


PP(o) is the maximum of the P(o,d) over all d associated 
with an 0, and CG(o) = C(1,0)Glo)e 


Two alternative ways of writing Equation 3A which are 
more compact and of importance Later are: 


(3B) Vlo,d) = expl(—-rD(0;d ))ACd NC 0 )MCo JALUC 0) 


WHERE 
AIU(o) = absolute isolated use PP(o0)POP(0)CG(o) which is the 
flow that would occur to each site or sites with 
the highest P(o,d) if that site were the only 
isolated site serving oO, 


M(o) = ( OE (Plo,gd)/PPlo,d)) **SE)**SM is the supply- 
generated—"demand" multiplier which expresses how 
much the absolute isolated use is inflated due to 
having more than one site near an origin, and 


NC o) 


1/ 2E Plo,;d) is a prorating ratio which normalizes 
the total use generated, MC o)AIU( 0) according to the 
magnitude of Plo,d) = exp(—rDlo,d))ACd) 


From the subscripts involwed in Equation 3B as 
indicated in Equation 3C, it is possible to see that one has 
a Cesario-type equation for use of a site from different 
origins as determined by site attractiveness and origin 
emissiveness parameters (see TN 4 and References cited 
there )e 


(3C) VWloed) = expl—-rD(o;,;d)) AC d ECO) 
WHERE E(o) = NC o)M(Co)AIU( oO) 


Though the issue is not pursued here (see TN 33), the 
preceding discussion can be reworded to give two further 
generalizations of the Cesario modele One involves 
introducing a destination alternative factore Beaman (TN Q9) 
presents empirical evidence that such a function involving 
the distribution of sites around a given site may be 
Jjustifiede The other generalization has to do with 
introducing considerations of other activities 
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(substitutability )e 

A matter which has not been stressed yet in this paper 
and which is not discussed subsequently is the importance of 
recognizing what type of people making what kind of use of 
parks a given model is appropriate fore In application it is 
obviously necessary to have data in which attractiveness of 
a park to fishermen is not confused with attractiveness to 
"recreational" camperse AlSo,y as stressed elsewhere, weekend 
and holiday attractiveness should not be confused with week 
day use attractiveness for recreational camping (see TN 8 )e 
Furthermore attractiveness to day~-users, non-main 
destination users etce should not be confused (see eege user 
definitions in TN 30)e 


MODEL JUSTIFICATION 


In the way of justification for the equations 
introduced one can note that when an isolated site is 
considered one sees that from Equation 3, Equation 4C, for 
use of an isolated site, is easily obtainede Since PP(o) = 
P( O,d)s 


(4A) Vlo,zd) = expl(-rDlosd))JACd)CPP(Co) / @E PP(o))¢ OE 
(PP(o)/ PP(o0o)) **SE)**SM POP(0)CG(o) 


WHERE the sums have only one term, the one given for the 
Single origin destination pair (o,sd)e 


(4B) exp(—rD(o,d))ACd)(1)( OE (1) **SE)**SM POP( 0 )CG(o) 


(4C) 


exp(-—-r( D(C o,d))ACd) POP(0)CG(o) 


Equation 4C is the equation for the use of an isolated site 
with attractivity A(d) from an origin o served only by d 
(see Assumption 3)e 

It is also easy to see that if one considers several 
Sites Equation 3 implies that the total flow from a city 
TF(o) = OE Vlo,d) is such that all sites receive less than 
or no more flow than they would when they are isolated sites 
if SE C 1 and SM < Ile 


Specifically: 

(5) TF(o) <S OE VIlo;d) 
since 

(6) Vlosd) S£ VIlosd) 


Where it is easily seen that Equation 6 holds since the 
factor which inflates actual outflow from acity is less 
than or equal to the factor that is implicit in the right 
Side of Equation Se The multipliers of concern satisfy the 
following relationship: 
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(6A) MCo)=€ @E (Plo,d)/PP(o) )**SE)*¥*SM SS 
@E (PCogd)/PP(o)) 
for SE Ciand SM <£ te 


This is because by the definition of PP(o0o) and Plo,d) their 
ratio is less than one so that even without taking the root 
SM°s 


(6B) @E (Plogd)/PP(0))**SE S£ OE (Plo,d)/PP(0o)) 


Equality holds for SE = SM = 1e Though there may be a reason 
to consider SE < 1 and/or SM > 1 as indicated in the review 
of the chapter, this possibility is not allowed heree One 
may wonder why the root SM was introducede The power SM is 
actually only considered to be less than one here so that if 
several sites have Plo,gd) = PP(o), one does not have equal 
increase or decrease in flow for each of these sites that is 
included in or excluded from a systeme For example, consider 
two sites: 


(6C) (€ @E (1 )**SE)**SM = (2)**SM < OE 1 = 2 if SM eNEe 1 


The doubling of emissiveness indicated by the left side 
of Equation 6C being 2 if SM equals one illustrates’ the 


general concerne As tlong as allt new sites that are 


introduced into a system are introduced so that their Plo,d) 


= PP(o), each new site receives as many visits as the site d 
which originaly had PP(o) = PP(lo,ed) if SM equals le But © 
introducing the root SM changes the model so that use does — 


not "multiply" with the number of sites in the way just 
indicatede 


In inductive terms, Equation 39 defines a system in) 
which the total emissiveness of an origin, reduced for 
distance effects, is prorated among the various destinations | 
according to the flow that they would receive if they were 
isolated sitese But they receive a smaller total flow than 
would be obtained if each site were an isolated site (unless | 
SE = SM = 1 or for special cases if SM = 1). In this — 
context, Equation 3 is interesting to consider when. 


examining the effect of creating a new sitee 


Consider what happens when a system has one site, d =. 


lL, ata distance Dlo,d) from acity, this site has an 


attractivity of Y,; SM = 1/2 and SE = 2e When a second site d_ 
= 2 with attractivity Y and also at a distance Dlo,d) from. 
the city is introduced into the system, say in the opposite 
direction from o as the first, Equation 4 implies that for 


both sites the new use at both d = 1 and 2 is: 
(7) WON) ((1**SE + 1**SE)**SM/(1+1)) V0,,0) 
((2)**(1/2)/2) Wly,0) 

= e& 707 Vl 930) 


WHERE V( , ,»N) indicates the new system use at d = 1 or. 


2, and V( », ,0) refers to old system use at d= le 
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Thus, the ratio of total attendance at the kind of 


sites considered in the old system to attendance in the new 
system is: 


mS) 2VCeyN)/VO e790) = 206707) = 16414 


About 40% more visitors are generated, but the same 


"good" is offered at the same price and consumption is up by 
40% — supply is generating "demand" (participation) —-— and 
it is reasonable to suggest that the "supply has generated 
demand". 


Related to this it is interesting to consider the cost- 


benefit associated with any increased satisfaction of 
people: are there any psychological benefits that would not 


have occurred had no new site been in? And, if there are 
psychological benefits, how would they have differed if the 


new site had been as attractive but had been located at such 


a distance that (P(o0,d)/PP(0)) = 1/2 so only a much smaller 


supply generated demand would have occurred (12%). Clearly; 
there may be advantages in not subsidizing visits that do 
not benefit people very muche 


ESTIMATION 


Equation 3 is the kind of equation discussed by 
Cesarioe In the terminology of this paper it is the Cesario 


type model: 


(10) Vlo;d) = F(D(0,a) ACA )E(O) 


Because of the identity in form between Equation 3C and 
Equation 10, the attractivities of the sites A(d)'s are 
numbers which one may believe can be estimated using 
Cesario's approach and one may also expect that a constant, 


K, and the impedance of distance coefficient, ry, associated 


with the distance function can be estimatede Caution about 
whether the r and A(d)*s will be "easily" or “accurately" 
estimated may relate (1) a bias depending on Logarithmic 
transformations and (2) heteroscedasticity considerations 
(see TN 19)e Still, parameters can at least be estimated as 
Cesario has suggestede 

Having read the tast paragraph the reader may be asking 
himself why has the equation appeared with C(1,0), C(2,0) 
and now there is reference to Ke The reason is that what 
could be called the “absolute" A( y®*s and E( )*¥s which 
appear in Equation 10 are not what are estimated by Cesarioe 
Because there is a problem in knowing on what "absolute" 
scale to measure attractiveness) of a park or city 
emissiveness, these are measured on relative scales so that 
if AA(d) and EE(o) are the estimates obtained: 


ACd) 


some constant times AA(d) K( 1 )AACd ) 


E(o) 


K( 2 )EE(o ) 


some constant times EE(o) 
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Actually; both AAfd)*s and EE(o)'s are estimated in such a | 
way that the product of all AA(€Cd)*s equais one and the same 
applies to the EE(o)*se This constraint or some similar one | 
is necessary if one is to be able to get estimates (eege see 
Reference 29 )e 

Now, the K(1i) and K(2) introduced are critical in 
understanding other estimation problemse The constant 
estimated in the regression, Ky, is; as cen be seen below, a. 
number that reflects the values of both K(1) and K(2)e The) 
relation that shows this is: | 


Vloed) = FODCosgd)) KC L)JAAC dG) KC 2) EECO) 
KFC D(o,d)) AACd) EECo) 


Without information to define K(1) or K(2) the other 
component of the estimated K cannot be determinede i 
Given the preceding considerations the estimation | 
procedure endorsed here for obtaining the parameters of the) 
generalized Cesario model is essentially the same two step 
procedure proposed by Cesarioe However; there are 
complications which arise because of the SE and SM. 
parameterse Regardless, the first stage estimation is by the | 
analysis of covariance (see TN 4), to produce estimates of 
K, r the A(d)*s and E(o)'Se | 
In the second step of estimation the emissivities of 
the Cesario formulation become the focus of attentione One 
might think that he has the information shown in Figure 2, 
which appears to show M times N observations to be used in | 
estimating M + 3 parameters: the CG(o)'s, K(2), SE and SM in| 
Equation 4 belowe | 


(4) EE(o) 


E(Co)/K( 2) 

(1/K(2)) CG(o) POP(o) * 
(PP(o0)/ OE Plo,yd)) * 

( OE (Pl0,d)/PP( a0) )*¥*SE )**SM 


But now it is necessary to consider whether the | 
equation for which parameters are to be determined is 
identified: whether it is mathematically possible to. 
determine unique parameter estimatese For the original | 
Equation 4 there are M blocks of N equations each as shown. 
in Figure 2e Thus it appears that M + 3 parameters are to be | 
estimated from around M x N observationsSe But consider 
taking the natural logarithms of both sides of Equation 4 
and moving allt known terms to the left-hand sidee Then one | 
has: 


(5S) DD(o) = SM tn ( OE (P(lo0,d)/PP( 0) )**SE) + 
Ln CGlo) + tn 1/K(2)) 


WHERE DD(o) is the sum of all known terms or one writes 


(6) DD(Co) = SM tn ( OE (Plo,d)/PP( 0) )**SE) + 
UC(o) + KK(2) + &(o0) 
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FIGURE 2 


THE RELATION BETWEEN E( )'s 
AND THE DATA FROM WHICH 
THEY CAN BE ESTIMATED 


i v(i,1) = e@ece AC 1)E( 1) | 
| | @ | ‘ 
Block 1 | e {| N Equations 
l “ | 
| VC 1,N) = eee ACN )JEC1 ) | 
1 VC2,1) = ecc AC1)E(2) | 
l ° i 
Block 2 | e | N Equations 
j ® | 
| V(2,N ) = ese ACN JEC2 ) | 
| V(M,1i ) = e@e€@ AC 1 )EC(M ) { 
l e l 
Block M | e {| N Equations 
d « i 
| VOM,yN) = ecco ACN)E(M) | 
OT OTOTSOLS: 


WHERE Ulo) = tn CGlo), KK(2) = tn (1/K(2)) and &(0) reflects 
the presence of errore 


So, when one moves to estimation of Equation 5, it can 
be seen that each of the M blocks in Figure 2 contains 
equations which could be transformed using first estimation 
cycle estimates to give equations like Equation 6e Each 
block essentially contains different observations of the 
same true value of DD(o) as a function of SE, SM, KK(2) and 
the U(o)"*se Thus one can see that E(1) or equivalently DD(1) 
is common to all equations in block 1 of Figure 2, E(2) is 
common to all equations ot block 2 etce 

So, one might as well only use the M estimated E(o)'s, 
the EE(o)'s, as the dependent variables, in order to 
estimate the unknown parameters for all originse The unknown 
parameters are as just indicated SE, SM, K(2), and the 
U(o)"*s or in terms of Equation 5 they are SE, SM, K(2), and 
the CGl(o)'se Regardless, as already stressed there are M t+ 3 
Parameters to be estimatede Now, even though the EE(o)'s 
were such that ®E ln EE(o) = O and thus have M-1 degrees of 
freedom, the DD(o)'s which are the dependent variable in 
Equation 6 do not satisfy the same linear constraint and can 
be considered to be M pieces of information allowing M 
parameters to be estimatede But it has been suggested that M 
+ 3 parameters are to estimatede From M observation this is 
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not possible unless something is donee 

Firstly, in proceeding to obtain estimates it is 
mecessary to recognize that if KK(2) and G(o)'s are 
considered in the same analysis of variance, in the way that 
Cesario computed a constant and city effects, a constraint 
must be placed on the Gl(o)'s to remove a tlinear relation 
between them and the "mean term" KK(2)-e So consider that the 
constraint is applied so that the estimated U(o)'s have a 
sum of zeroe Then there are M * 2 parameters to estimatee 
But applying the constraint is an indication that it is not 
the absolute CG(o)'s, or in logarithmic form absolute U(o)'s 
that are being estimated buts 


(7) UUCo) = K(3) Uo) 


So, also one sees that one will not get an estimate of In 
K(2) but K(3) in K(2)eo 


Still with the constraint on the U(o)'s there are two 
too many parameters to be estimatede So the problem is how 
to get estimatese One way is to eliminate the need to 
estimate two of the U(o)"*se One basis for doing this is to 
say that some cities are so similar that they are accepted 
as having the same U(o)"*se For example one may assume that 
UC iJ=UC j) and UC k)=U(r) where i,jsk and r indicate at least 
3 different origins (i could equal k)e 

Estimation in which the strategy just described was 
used to achieve estimates of parameters was tested using a 
special non-linear estimation programe This is commented on 
in the discussion section of the papere 

As tong as only M-3 U(o)*s are introduced one can 
estimate the Glo) up to a multiplicative constante Still, 
the ln CG(o) = U(o) are not the ultimate object of intereste 
The planner oor researcher ultimately wants to be able to 
explain U(o)'s or the corresponding CG(o)'s by observable 
information (e€ege socio-economic )e Viewed in this way one 
can see why there need not be interest in estimates of the 
M-3 CG(o)*s but there should be interest in parameters that 
show how CG(o)'s for cities relate to socio-economic 
information about the residents of the citiese 

A second alternative can in fact be pursued which is 
analogous to Cesario Stage II estimatione If one accepts 
that the U(Co)'s in Equation 6 can be written as: 


(8) Ulo) = tnl(AE(o)/POP(o) ) 

WHERE by Equation 2C 

(9) AEC o) = C( 2,0) (UNC o) + OE Blisj nC irj)) 
Taking UN(Co) out of the sum and iusing the Taylor series 


expansion of the natural log (1 + X) being approximately X 
for X much less than one one has: | 


(10) UlCo) = —-ltn POP(o) +t tnNCo) + Ln C(2,0)U) + 
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tn(1 + OE (BCi,j) nC isj) )/UNCOo)) 


(11) UlCo) equiv tn(U)tln(C( 2,0) TOE (BCi,gJ nl i, j)/( UNC 0) ) 


Substituting the approximation for U(o) into Equation 6 one 
gets the equation for which estimation may be carried out: 


(12) DDCo) = SM tn (CC OE (PCogd)/PP( 0) )**SE) + 
KK(2) + Un€@U) + Utn€C(2,0)) + 
ME (BCiszj) ul i,sj)/C UNC O))) 


Now, there may be some concern with what was done above 
because the sum of the (BCi,sj)n( i,j) )/(C UNC 0o)) might not be 
smalle But, experience in working on TN 16 and TN 12 has 
shown that the sum will be smalle However, if this isa 
problem there is an alternative formulation where the mean 
participation rate, y mean, is used instead of U and the sum 
of concern then is . only a correction for a population 
difference from the average population for which the Bli,j) 
were estimatede The sum in this alternative formulation 
given by Equation 17 of TN 6 will only be large in the rare 
case that a population differs drastically from the 
population for which the B(i,j) were estimatedea 

Depending on whether or not the U and Bli,yj) are to be 
treated as parameters to be estimated or known values used 
to arrive at an equation in which SE, SM, KK(2) and 
C(2,0)'s are the unknowns, there will be different matters 
of concerne If the Bli,j)'s are to be estimated, it is 
reasonable to follow the strategy that Cesario implicitly 
did in his emissiveness analysis of considering C(2,0) = a 
constant for all oe Then, one need only be concerned that 
M—3 of the Bli,j)*s can be estimated along with (1) a 
constant that has a value dependent on In C( 2,0), ln U and 
KK(2)3; (2) SE$3 and (3) SMe On the other hand if U and Bl(i,jJj) 
values are determined exogenously and are used along with 
known n(iszj) and N(o) to make a correction for population 
distribution, one has the problem discussed earlier with 
respect to estimating the CG(o)'s or their logs, the U(o)'Se 
As indicated in that discussion, identification can be 
achieved by specifying that C(2,i) = C(2ej) and C(2,k) = 
C(2,r) while the C(2,0)'s are transformed in scale by 
applying the estimation constraint that the sum of the 
estimated tn (C(2,0)) must equal zeroe 


RESULTS AND DISCUSSION 


The model proposed here preserves the elements of the 
Cesario approach, but extends it to include an alternative 
factor and other parameters to “explain" how people respond 
to change in supplye 

There was no question that the "Stage I" emissiveness 
and attractiveness parameters could be estimatede But there 
was enough concern about whether Stage II estimation would 
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be successful that the estimation test alluded to earlier 
was carried oute A simple example estimation was run using a 
two park, five city system with sufficient constraints on 
the CGl(o) so that the parameters in Equation 5 were 
identifiablee The values of the dependent variable DD(o) of 
Equation 6 were computed based on "correct" values for SM, 
SE, KK(2) and the CGlo)e In practice, of course, the DD(o) 
would have a stochastic element but concern was not with the 
sensitivity of the parameters to error in DD(o)e Using the 
predetermined DD(oc) as dependent variables and an assumed 
initial value of SE, Equation 5 was estimated by an 
iterative least squares regressione 

The iterative procedure used is straighforwarde After 
assuming an initial value for SE, SM andthe U(o)'s were 
computed by linear regressione The initial value of SE was 
then corrected according to a "gradient" criterion which 
moved the first estimate of SE towards the value which 
minimizes the sum of squared residualse This new value of SE 
was used in a second step to produce new values of SM and 
the Ulo)*s; SE was again re-computed, and so on, until 
"convergence" to a minimum sum of squared residualse The 
principles involwed were essentially those on which PLEAS is 
basede 

After 3 -—- 5 iterations in a variety of different tests, 
for different initial values of SE, the original assumed 
values of parameters were recoverede Thus it was shown that 
the structure is such that the model is estimable Leading to 
a unique solution, at teast for the particular case 
considered and when the kinds of assumptions specified 
earlier can be madee 

Certainly, it is possible to specify parameter values 
for which estimation cannot be carried out (SE = SM = Ld)e 
But except for this rather unlikely casey and possibly some 


others Like it, it is not to be expected that estimation — 


problems will arise except when the "DD(o)'s + error" are so 
variable that no "reasonable" parameter values will explain 
theme 

Thus what is proposed in this paper is a modification 


of the Cesario estimation approach that allows one to take 
into account people's response to alternative parks that | 
they can visit that are around theme In this model one > 
explicitly recognizes that the emissiveness estimated in 


Cesario's first step of estimation should not be analyzed 


without considering the configuration of supply around a> 


citye The formulation is such that one has an explicit 
function defined which allows one to take into account the 


attractivity of alternative destinations and their | 


geographic contiguration, when one wants to calculate what | 
will happen when a new park is added to a system or  an-_ 
existing park is removede Actually a similar operation can | 
be carried out using the model introduced in TN 1e There | 
Cheung even presents an example of how a system is altered 


when a new park is introducede He has computed new origin 


destination flows based on changing his alternative factore | 
Incidentally Cheung's alternative factor has much in common) 
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with the alternative factor derived here and others that 
‘have been defined (see TN 30 and IN 3)-« 

The reader may wish to note that originally the 
aegeneracy problem encountered in Stage II estimation (which 
was described in the Estimation section of the paper) was 


not recognizede When Bacon (see Reference 3) worked on the 
| 


problem origin-destination flows were generated and in Stage 
/II estimation with 1300 flows no answer was found as to why 
attempts to estimate 27 parameters for 26 cities always 
failed because matrices which were to be inverted to 
estimate the parameters were of rank 25 or lesse It was not 
obvious that with all the origin-destination flows one 
really only had 26 “independent" pieces of information on 
which to estimate parameterse 


It is not being suggested that the model defined gives 


la better fit to data than the Cesario modele Obviously the 
fit to given data is determined by the first cycle of 
estimatione What is claimed is that the second cycle of 
estimation results in inherent emissiveness values’ should 
not be used in making estimates of what will happen ifa 
system of parks is changede Cesario's attempt to estimate 
the relationship of his emissiveness values to the aggregate 
socio-economic characteristics was not successfule This is 
probably because more variance is associated with the 
alternative factor component of Cesario's E's than with the 


inherent emissiveness factors, the CG(o)'s,»,» which depend on 
socio-economic characteristicse 
It would be quite possible to take a given system of 


parks and carry out a simulation exercise based on parameter 
estimates to find out how much of the variation in the Stage 
fe emissiveness is related to the configuration of parks 
around the origins considered, and how much is related to 
socio-economic characteristics of the inhabitants of these 


originse Simulations along this line have shown that once 
size of population has been considered, variation from other 


socio-economic characteristics may be expected to produce no 


more than about 5 to 10 percent difference in the amount of 
Participation the different cities will generate. (CG's do 


not vary much but C(2,0)'s~= may )e The point is that the 


configuration of parks around cities may account for a much 
larger variation in the amount of participation that cities 


will generate than do socio-economic variablese 


In practical terms the last paragraph indicates that 
the spread of the EE(o) values about the Line shown is to be 


 expectede It is to be expected because the distribution of 


supply around various cities is affecting the values that 
EE(o)'s have as well as these being influenced by N(o) and 


differences in the socio-economic characteristics of citiese 


These authors believe that in terms of the model 
presented, the important research step that must be taken is 
not the consideration of the effect of income or other 
variables on EE(o)"*s which is what Cesario suggested (see TN 
4), but the identification of the influence of supply on 
EE(o)'s and then consideration of the importance of C(2;0) 
or CCi,o) factors of city uniqueness in explaining 
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behavioure 

It would be nice to be able to report that computer 
runs hawe been made using the data which Cegario used that 
confirm the validity of "The Generalized Cesario Model". 
However, this is NOT EXACTLY the casee Weighted regression 
runs (TN 19) to estimate more accurate emissiveness 
parameters than those presented in TN 4 have been processede 
The EC )*s obtained have been analyzed to determine if they 
are in accord with the formulation for an origine 


E( ) is equivalent to K (Prorating ratio) * 
(supply generates demand effect) * 
(Population ) 


E( ) is equal to K X(2,0)( X( 3,0 )**SM )POP(0o) 


Actually, it was convenient to determine if the 
prorating ratio and ~ population have an exponent of onee 
Because, as indicated above, the supply generates demand 
effect has an exponent SM, this was estimatede The results 
were that Cesario's emissiveness walues were not compatible 
with the ~ generalization in the Note, but rather with the 
generalization in which SE is less than one, not greater 
than one, and in which the maximum potential to visit a 
park y from PP(o), is an important variable. One model 
developed had a R? of 0e80 when population alone was 
introducede When the other two variables were included R@ 
was 0e95e When these other two variables (PP(o) and X(3,0) 
with SE < 1) were introduced, each alone, (given the other 
and population) had already explained a significant amount 
of varianceée However, this model is not discussed because 
its definition and the rationale behind it, would be a 
deviation from the theme of this papere It is planned to 
present these results in a separate non-CORD Study papere 

Nevertheless, the results of estimating parameters for 
SE = 162 and ie4 were invalid, because of the collinearity 
between X(2,0) and X(3,0)e The correlation for SE = 1e2 was 
0e985, while the correlation for SE = 1e4 was O.«955e To 
obtain estimates of parameters, either X(2,0) or X(3,0) was 
eliminated from the equatione When this was done, it was 
found that the coefficients B(2) and (-B(3)) did not have 
values between zero and onee That was to be expected 
because B(2) should be one and X(2;0) and X(3;0) have about 
the same variance and negative correlatione Therefore, 1 > 
B(3) > O« This means that if B(2) is estimated, it should 
be TOO SMALL by the value of SM; and if SM is estimated it 
should be TOO SMALL by -1, because its value will include 
—-B(2)-. Actually, B(2) values were in the -2 range, and B(3) 
values were in the +2 rangee Population had the coefficient 
of 1.07 (as determined by Cesario)e 

The values of B(2) and B(3) suggest that the structure 
of the particular Generalized Cesario Model used was not 
appropriate to the datae Because’ the two regression 
coefficients relating emissiveness to X(2,0) and X(3,0) were 
too large, it seems clear that: 
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le being located in park-like landy there was not much need 
to use "park-influenced emissiveness"; and 


2e the "configuration" of parks around a city reflect: adJa 
history of trying to meet needs, or b)little need to meet 
needs because of a situation such as that suggested in le 


The regression results with the highest R* values show 
the plausibility of these latter thypothesese In the 
preceding equation for X(3;30), when one lets SE be less than 
1, the number of parks with much potential are parks which 
would be used if others are full, etce is emphasizede 
Introducing the highest potential PP(o) as a variable serves 
as ameans of identifying origins in high quality park 
country other than by X(3,0) which can have contradictory 
meaning because potentials are normalized to a PP(o)e This 
can be poor for the best of parkSe 

In particular, it can be seen from the comments on 
B(2,0) and B(3,0), that the values are not in the ranges 
suggested! Still, as the results indicate the regression 
coefficients are highly significant and explain highly 
Significant amounts of varianceée 

The kind of research suggested above has not proceeded 
beyond the stage just described because as work on TN 19 and 
TN 35 progressed it became clear that it was necessary to 
use weighted regression to get more accurate estimates of 
the EE(o)'"s than originally obtained by Cesarioe These were 
obtained late in 1974-e However, getting the new estimates of 
the EE(o)'s, which were much more accurate than ordinary 
least squares estimates, only raised a new problem and that 
was that the Cesario model should be rejected as 
structurally invalid for the analysis of the data to which 
it was being appliede Certain obvious problems with the data 
used include (1) using combined data for week day, weekend 
days and holidays (see TN 8) and (2) problems in defining if 
a visitor was really a main destination visitor (see TN 30 )e 
Also there is a problem commented on subsequently in that in 
the Cesario data are only data on part of the parks ina 
systeme 

In summary , recognizing the problems already cited 
meant that EE(o)'s and park attractivities estimated by 
Cesario might not have clear meaningse Thus further 
sophisticateaq analysis of the EE(o)'s was not Jjustifiede 
Ideally a new analysis will be carried out ona new large 
data set on a truly complete system ot parks and with good 
definitions and data collection practices being used in 
assembling these data (see TN 21 and Volumes I and II for 
details on the problems with CORD Study Park User Surveys )e 
Then, one will be justified in more elaborate empirical 
analyses of the EE(o)'s than that presented earliere 

However, the proceeding discussion evades the issue of 
how appropriate the Cesario model is to data to which it has 
been appliede Empirical results have been derivede Applying 
the Cesario model to estimate parameters that explain 
visitor flows for Ontario (TN 4) and then using the results 
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to simulate origin-destination flows has shown that whereas 
the Cesario model (when used with real data) results in an 
R-squared in the order of e8, it should result in a much 
higher R-squared, as indicated by the R-squared values 
obtained when estimating parameters for simulated Ontario 
visitor flowse For the Ontario Parks on which estimation was 
originally done, parameter estimation on simulated data for 
Ontario Parks showed that the R-squared for estimation 
should be in excess of e99-e The "generalization" suggested 
may explain part of the variance not explained by the 
relation between population and emissiveness, but it does 
not explain any of the 20% of variance which was teft 
unexplained after the Cesario model was appliede This 
obvious failure in terms of explanation points up the 
structural deficiency of the Cesario model (for tests of 
such models structure see TN 19 and TN 35)e 

One may reasonably ask what could be wrong with a model 
as general as the Cesario modele A simple answer is that the 
model does not allow for asymmetries in behavioure The kind 
of situation which might be creating problems is shown in 
Figure 3, where people do not go to a given park froma 
certain city because it is readily available as supply to 
other lLocationse People in city 3 have a park which from 
their perspective appears more attractive than park A which 
appears attractive to city 1 and 2e In particular the other 
source of problems, as suggested earlier, is almost 
certainly aggregation over different types of use (see TN 3, 
TN 8 and TN 40, etce) Obviously if the data on camping 
combines continuous users or holiday users with weekend 
users different functions should be used to explain the 
behaviour of these peoplee It is not enough to simply deal 
with camperse 

At this point it remains unknown whether introducing 
this further consideration would result in a drastic 
improvement in the R-squared of the Cesario model but it is 
these kinds of questions which must be taken up to determine 
the appropriateness of the Cesario model to sufficiently 
disaggregated datae 

Previous parts of this discussion section have 
suggested ways of testing the validity of the model proposed 
heree However, one of the most direct validity tests has not 
been mentionede The Bliyzj)'S;s which can be estimated if it 
is assumed C(2,0) is equal to aeconstant for all o'S, 
should agree with the B(i,j)*s obtained by an analysis of 
variance of national survey information on people's use of 
parkse CORD Study national survey information provides an 
abundance of data on parks use and even the similar Canadian 
Government Travel Bureau surveys provide some datae So if 
there is success in estimating Bli,j)'s for the Cesario 
model when it is applied to disageregated data for which it 
is more structurally valid than it is for the Ontario data 
(TN 4), then there is an abundance of “independent" 
estimates of these Bli,j) so that comparisons can be made 
resulting in a test of model vwaliditye 

Finally, only once in the preceding discussion has the 
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FIGURE 3 
PTIUE RELATION OF TEREE CITIES TO TVO PARKS TEAT ILLUSTRATES A SITUATION 


WHERE THE CESARIO NOQDEL WOULL NOT GIVE GOOD USE ESTIMATES* 


Above 1, 2 and 3 are cities, while A and B are parks each with attractivity X. 


* It is possible to further generalize the Cesario Model so that 
thegattractivity of A to people from 3 2s) corrected” so that 
it is less attractive than B because of the “competition” effect 
from cities 1 and 2. This is not discussed here, and it involves 
major changes to Cesario’s original formulation, that have nothing 
to do with the generalization of the model introduced here. 
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importance of system definition been introducede Yet, for 
Stage II analysis, defining what parks are considered by 
people from a given origin for a given type of trip ona 
given type of day (week day, weekend day or holiday) is 
cruciale With data on any number of origins and destinations 
one can proceed with Stage I analysis but if one does not 
have attractivities for all parks affecting use from a given 
origin then such factors as the following cannot be defined: 


SM Un€ dE (PPCo)/PCo,d) )**SE) 


(Actually, except for simulations; one may argue that 
SE and SM are not system's parameters but should change over 
time or with changes in supplye This matter is of importance 
still it is not examined in this papere A paper must end 
somewheree) No more need to be said if specifying “the" 
System were a simple matter of enumerating certain types of 
facilitiese Unfortunately, the problem of substitutability 
should be considered. for reasons suggested by Veal (see 
Reference 5) and also discussed in TN 10 and 33 (see also TN 
32 and TN 37)e Concerns about how the decision to go toa 
given place is made are elaborated in TN 33 where 
substitutability is introduced into the Cesario model in two 
different waySse One has to do with substitution at 
destinations and attractivity, and the other with 
substitutability at an origin and emissiveness- 


CONCLUSION 


The comments of the last few paragraphs show that much 
of the apparent conceptual elegance of the "generalization" 
ot the Cesario model is counterbalanced by problems’ that 
have been ignored or are not dealt with in this papere 
Still, the authors believe that the questions raised in this 
paper and the ways of understanding visitor flows represent 
useful progress in developing more meaningful modets' of 
behavioure 

This paper has shown how the Cesario model can be 
generalized so that inherent emissiveness can be extracted — 
from the emissiveness parameters that Cesario originally . 
showed how to calculatee This means that if the 
generalization is even approximately structurally 
appropriate to a given situation, system model parameters 
can be calculated which can validly be used in estimating 
the consequences of, for example, deleting a given park from 
a systeme Cesario's Stage I emissiveness values’ should not 
be used because their value is conditional on the system for 
which parameters are estimatede Thus a critical step in 
making the Cesario model useful to planners has been takene 
It also seems fair to claim that a useful theoretical 
contribution has been made in clarifying what the 
emissiveness parameters in the Cesario model really measure 
in a particular casee 

If the paper prompts some other researcher to pursue 
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research that was suggested it will have served another 
useful purposee For example, it seems very important that 


some research be done to clarify how much Stage 7 
emissiveness depends on supply configuration in a variety of 
circumstances as opposed to aggregate socio-economic 


characteristics associated with a citye 
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REVIEW 


Gordon Ewing 


The six papers in this section discuss what are 
commonly referred to as trip distribution modelse The models 
range in complexity from a variant of the most simple 
gravity model to one which is amongst the most generalised 
in the present Literaturee 

The first three papers by Cheung et ale involve 
different elaborations of the simple gravity modele In the 
Simplest, "A Method for Predicting Enroute Overnight Park 
Use", (TN 18), the form of the basic gravity model, V( i,j J=k 
MC i )**aM( j) **8/DC i, j)**Y is retained, but with the terms in 
the numerator redefinede Where M(i) normally refers to the 
size of the origin i, it is replaced by T, the volume of 
traffic on a section: of highway leading to j; and M(Jj)+, 
destination size, is redefined as the number of campsites in 
park je The other two papers by Cheung replace destination 
mass, M( ij)» by a measure of j's attractiveness, T(j): which 
is exogenously defined as a function of several park 
characteristicse In addition, a term measuring the strength 
of competition from alternative destinations completes the 
modified gravity modele As with the attractiveness terms, 
the "alternative" term is measured exogenously and simply 
included in the model as an extra independent variablee The 
distinction to notice between the other two papers is that 
"A Day Use Visitation Model" posits that the independent 
variables, with the exception of distance, are additive in 
their effect on VCigdj)ds whilst "An Application of 
Mathematical Models to Compare Two Potential Park Sites" (TN 
7) assumes the same variables combine multiplicatively to 
affect V(i,ygj)Je AS is common in trip distribution modelling 
studies, there is no discussion of the theory that might 
give rise to these alternative composition rulese Population 
growth, change in family composition, population "aging", 
etce are not considered in TN 7, although it should bee 

Whilst the model in Cesario's TN 4 is a variant of the 
Simple gravity model, the innovative feature is that origin 
emissivity, E(i), and park attractiveness, A(j),; now become 
parameters to be estimated in calibrating the modele The 
implications of the innovation are importante In calibrating 
the traditional gravity model with several origin and 
destination variables, it is never clear whether poor 
calibration results reflect the absence of important 
variables or the incorrect structure of the model or some 
combinatione Removing the need to pre-specify the presumed 
important variables means that poor calibration is more 
clearly attributable to an improperly structured modele 

It also transpires that the ability to estimate E(i) 
and A(j) values in what appears to be a simple gravity 
model, V(Ci»gj) = ECiJAC J) £°0DCiygj)), enables these estimates 
to be used as dependent variables in a second round of 
parameter estimation in a more complex distribution modele 
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Evidence for this is to be found in the Beaman et ale 
paper, TN 11e Indeed, although Cesario's concern is not to 
calibrate a generalised distribution model, the method he 
describes clearly provides the impetus for the remaining two 
papers in this section which involve much more _ general 
modelse 

Both "A Work Plan for the Development of a Mathematical 
Model to Predict and Explain Overnight Use of Parks" by 
MacDonald, Netherton and Cesario and TN 11 paper contain 
terms not found in the other paperse The major difference 
between the two papers is that the former is concerned only 
with the formulation of an appropriate mathematical model, 
whereas Beaman et ale paper consider the mathematical 
problem of how to estimate parameters in the modele Put 
simply, the two papers are about the interpretation of 
emissivity terms as estimated in Cesario's papere They argue 
that an origin's emissivity is firstly a function of its 
size and socio-economic compositione In addition, since E(i) 
is unique to each origin, they argue that it contains within 
it an estimate of the spatial supply of facilities which is 
also unique to any origine In particular, they see this 
supply as having two separate and opposite effectse One is 
referred to as the alternative factor, defined in terms of 
the sum ot alt parks’ attractivenesses discounted for 
distancee This is similar to the alternative factor in two 
ot the Cheung et ale papers, except that Cheung had to 
ignore the effect of alternatives! attractiveness for Lack 
of informatione In all cases the size of the alternative 
factor is seen as having a negative effect on the patronage 
of any one park, in the sense that these alternatives 
constitute competitione However, at the same time that all n 
- 1 alternatives provide competition for the nth park, all n 
parks; and in particular their attractiveness as mediated by 
distance, can be thought to have a supply-generating-demand 
effecte That is to say, the more a particular spatially 
distributed facility is provided; the more use there is 
likely to be of such facilitiese Thus, for example, given a 
Latent demana for cross-country Skiing, the more facilities 
that are provided, the more participation is likely up to a 
pointe TN 11 discusses the problem of separating these two 
effects embedded in the emissivity term for each origine 

Turning to the individual papers, each has features 
which merit commente Cheung's "Day—-Use Model", which 
provided the impetus for many succeeding Techincal Notes, is 
Characteristic of earlier interaction models in that the 
form of the model is Simplified to make statistical 
estimation straightforwarde Thus, unlike Later work, the 
attractiveness of destinations is computed exogenously and 
weights on the distance terms are determined by trial and 
errore In particular, the unusual procedure of having three 
different distance exponents depending on a destination's 
distance is usede The rationale is that the overall R* value 
in the calibrated model is maximised using these three 
Parameterse However, it is of note that a Single exponent 
for all distances of either 1-5 or 2 results in only a 0-01 
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reduction in R%e Statistically it is very probable that the 
increment of 2 parameters in the model accounts for this 
small increase, rather than the variable distance exponent 
having any behavioral significancee This variable exponent 
on a destination's distance contrasts with the treatment of 
distances to alternatives which are all assigned an exponent 
of e5 in the alternative factore No rationale for this 
inconsistency is suggestede 

Turning to the attractivity factor in the model and in 
particular to the function used to calculate its value (T( Jj) 
= @E (e)S(e) OE (m)R(m)IQ(m), its form deserves commente The 
relative popularity rating of activity e, Sle), is measured 
as a direct function of a Canada-wide participation rate in 
the activitye The importance of facility m(R(m)) in enabling 
activity e to be enjoyed is calculated from the correlation 
between a park's attendance and the "amount of" facility m 
at the parke It would seem that there is an element of 
“double counting" in the above equatione For example, if 
horse riding has a low popularity rating (S(e)), it seems 
Likely that horse rental at a park wiltt do little to 
increase its attendance figurese The latter being the case 
will produce a low importance rating (R(m)) for horse 
rentale However R(m) is clearly dependent on S(e) in this 
case, whereas the implication of the equation is that the 
terms measure separate and independent factors contributing 
to park attractivenesse 

One other question about the attractivity function is 
how the "rank" scores (Q(m)) were arrived at for the amounts 
of certain facilitiese For example, why does a 9-hole golf 


course rate 825, and an i8—hole course rate 1l,y or why 6 
showers rate 8 and 8 showers rate 10e5? Why, also, are rank 


scores then used in calculating T(j) as if they had interval 
scale properties? 


It is noteworthy that the T(j) values calculated 


contribute one-fifth of one percent to the explained 
variance of Vli,;j),3; the dependent variable in the modele 


AS regards the model itself, three points stand oute 
Firstly, 84% of the 91% of explained variance in V(i,j) is 


attributable to P(€i)/ge(Dlirej)), the population of the origin 


mediated by distancee This dominance of the origin's mass 
term (population) in interaction models where other non-mass 
terms are included is a familiar patterne Indeed it is 


difficult to cite any published work where origin mass is 
not the major contributor to the variance explained, along 


with distancee On the face of it, it might seem that trip 


distribution modelling is a subject that requires Little 
further investigation, if only 10% or 20% of variance is 
left unexplained by the simplest gravity modele However, a 
large value of R?@ in this case depends on there being a 


large variance of origin mass which produces ae large 
variation in use figurese If an area were studied where. 


origin masses were more uniform, variance attributable to 
origin mass would be much lesse In such a case, R*? would be 
greatly reduced if the trip distribution model were not 
properly specitiede In other words, when origin mass has a 
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large variance, just about any interaction model will give a 
high R* provided origin mass is one of the independent 
variablese But where the variance of origin mass is small, 
any but the properly specified interaction model will result 
in a low R® valuee 

The second feature of the model which is common to most 
empirical calibrations of interaction models is that the 
objective function used in estimating parameters is the 
minimisation of the residual sum of squares of a Linear 
regression model, or in other words the maximisation of R#« 
It has been argued elsewhere (see Reference 38) that R@ is 
often quite insensitive to significant parameter changese An 
example of this in the present paper is given in Table 5e 
There is only a 3% variation in R*® as the distance exponent 
varies between 1-5 and 3 and indeed between 1-5 and 2e5 the 
range in R@ is only 1%. Yet these exponents represent a 
sizeable range in terms of the typical values found ina 
wide variety of gravity model studiese It has been suggested 
that whilst  R* is a valuable measure, it can be used in 
combination with other measures, such as mean predicted 
distancee This latter has been found to be much  =more 
sensitive to parameter changeSe Thus an alternative 
procedure would be to seek parameter estimates which 
minimised the difference between mean observed and predicted 
travel distances, subject to a constraint on the degree that 
R* could deviate from its maximum valuee 

A third feature of Cheung's interaction model which is 
unfortunately common to most interaction studies is that no 
test of the homoscedasticity of the independent variables is 
providede The effect of a variable violating this assumption 
can be to artificially increase R@e Consider origin 
population since that is the most important variable 
contributing to R@.e If, for example, there are many small 
origins and one or a tew very large places, with only a few 
intermediate sized centres, which is a not uncommon urban 
size distribution, this distribution is heteroscedastic and 
will undoubtedly result in an appreciably higher R*? value 
than sO PE Hie = was homoscedastice Although Cheung does not 
provide information on the size distribution of the origin 
areas, Table 8 gives a sample of origin area sizese Fourteen 
of the fifteen in that table range from 1,000 to 8,000 in 
population, whilst the fitteenthy, Saskatoon, has a 
population of 117,000-e This last value seriously violates 
the homoscedasticity assumption, and it might have been 
interesting to calculate R* with and without Saskatoon 
included in the analysise 

One final question relates to the reliability of data 
based on a 56% voluntary return of completed questionnaires 


by visitors Leaving parkSse Whether this group is 
representative of the universe or of those given the 


questionnaires on entry to the park is unknowne 


Many of the questions addressed to the first paper can 


Obviously be repeated for "An Application of Mathematical 
Models to Compare Two Potential Park Sites". A very similar 
equation is calibrated, but no indication of the value of 
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R* is possiblee This is a little distressing in that the 
accuracy of the predictions for the future being made in 
this paper depend in large part on the accuracy of the 
modele However, even if R* were high, the question of 
heteroscedasticity still leaves in doubt the reliability of 
estimated parameters and the validity of the model as 
structurede 

The accuracy of prediction of the future use of parks 
depends first on the validity of the interaction model and 
the accurate estimation of its parameterse Secondly, it 
depends on how accurate estimates of future use levels of 
parks aree In this paper, the Latter are estimated by 
extrapolating forward figures based om an average of recent 
yearse One obvious question is whether present trends are 
likely to continuee This depends both on the future rate of 
population increase remaining steady, particularly amongst 
the age groups most active in park use, as well as’ on the 
future recreational ‘preferences and constraints of the 
population not changinge The stability of the latter in 
particular is questionable, although it is difficult to see 
how changing preferences or constraints and their effects 
can be reliably predictede One possible approach, not 
mentioned by Cheung, would be to calculate whether the rate 
of change in park use has itself been changing over recent 
yearse For example, a three-year running mean of the change 
rates in Table 4 might show whether or not the change rate 
was steady or note 

One final feature of the paper deserving mention is the 
treatment of site attractivenesse Although park 
attractiveness is not part of the model (for reasons 
explained by Cheung), a method is used to estimate the 
attractiveness of two existing sites and this is then used 
as a surrogate for the projected parks, prediction of whose 
future use is the goal of the papere However, the grounds on 
which the parallel is drawn between the two existing and 
future sites are not indicatede In general, two alternative 
methods exist for estimating the attractiveness of an as yet 
non-existent parke One, (possibly rare) procedure would be 
to do as Cheung did and find an existing park whose 
characteristics were close to those of the proposed park, 
both in quantity and qualitye The other park attractiveness 
to a set of measurable park characteristics, so that only a 
knowledge of the latter quantities in any future park would 
be necessary to accurately define its attractivenesse 

In "A Method for Predicting Enroute Overnight Park 
Use", a high Re is obtained after a Logarithmic 
transformation of the data and is not a measure of goodness 
of fit between the untransformed predicted and observed data 
in table 2.e R*® in the latter is 0-93. Also the percentage 
error in prediction of the three largest use figures is only 
about 8% whereas the mean percentage error for the rest of 
the data is 21%-e This indicates that after Logarithmic 
transformation, the three largest values were much more 
accurately predicted than the average, which suggests that 
even after Logarithmic transformation the homoscedasticity 
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assumption is still not mete 
It is interesting to note that origin size, in the form 
of traffic volume on the nearest arterial highway, and 
distance to that highway explain relatively Little variance 
in enroute camper volume compared to campsite capacitye It 
is rare indeed to find either the origin or destination mass 
term not contributing a large amount to the variance 
explainede However, the fact that distance contributed so 
Little to the variance explained might provide a clue as to 
why highway traffic volume also explained very little, vis- 
a-vis campsite capacitye It is most improbable that 
distance, per se, has little effect on patronage: so many 
recreation studies indicate otherwisee However, it the 
range of campsite-to-highway distances is quite narrow it is 
possible that, since people are not infinitely sensitive to 
distance, the differences in distance are not large enough 
to have any appreciable effect on patronage. Likewise, if 
the range of highway traffic volumes (or at least camper 
traffic volumes) is small, they may give the appearance of 
having Little influence on campsite attendance, even though 
a larger range of values might show them contributing 
appreciably to R*e Since the original data are not provided 
in the paper, the Latter must remain speculativee However, 
if the regression results are taken at face value, and there 
is assumed to be appreciable variation in highway traffic 
volumes and distances, the appropriate planning response in 
developing parks to cater to enroute campers would be to 
give little attention to arterial traffic volume and 
distance to highway and consider only park sizee In other 
words parks could be located anywhere and only their size 
would be a consideration, if the results of the regression 
model were taken at face valuee Such an interpretation is so 
seriously at odds with common sense that one is inclined at 
least to accept that either arterial traffic volume or 
distance or both are important, but that this study simply 
lacks a sufficient range on these variables to measure their 
eftfectse Certainly, the rarity of finding either "origin 
mass" or distance to be of relatively minor importance in 
explaining visitation levels is such that the findings 
Should not readily be accepted without further enquirye 

Cesario's paper breaks new ground in trip distribution 
Studies for reasons stated earlier in this reviewe The paper 
also should be seen as part of a continuing series of papers 
by Cesario on this topic (see Reference 9)e In this regard, 
it should be noted that his paper in the Journal of Regional 
Science, January, 1974, describes a solution to the problem 
of biassed parameter estimates which he indicated to be in 
progress in his recommendation Noe 4e 

Basically Cesario's analysis of convariance model is 
used to estimate one set of parameters (EC i), i = ly 2e ecocy 
n) which are unique to each origin and another set (AQ j)_ J 
= 1, 23 eees m) which are unique to each destinatione The 
major area of debate relates to the interpretation of these 
parameterse Cesario's opinion in this paper appears to be 
that “the emissiveness of population centre iy E( i)», 
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reflects its relative (to other population centers ) 
propensity to emit trips under identical circumstances —-— 
that iss, if it were the case that all centres were 
contronted by the same availability or "supply" of 
recreation opportunities". He provides no rationalisation of 
this assertione In contrast, the paper by Beaman interpretes 
E(i) more generally as a composite measure of several 
factors unique to origin i, one of which is its unique 
spatial supply of park facilitiese It is, of course, unique 
in that no other origin has the same spatial relationship to 
the set of parks under studye This latter, broader 
interpretation of E(i) would seem to be the more defensible 
one at the present stage of knowledge, until such time as 
empirical evidence is provided to show that the spatial 
supply effect is negligiblee 

Cesario's interpretation of attractiveness of park Jj, 
AC j), is that it "reflects its relative (to other parks) 
ability to attract trips under identical circumstances - 
Cees if it were the case that all parks were equally 
accessible from each population centre". Since the effect of 
distance between i and j is separated out in the model, it 
is certainly acceptable to define “identical circumstances" 
as allt parks being equally accessible from centre ie 
However, in a vein Similar to the argument about 
emissiveness, the reader may be inclined to suggest that 
contained within A(j) is the competition which park j faces 


from the other parks. However, the competition which park j_ 


faces varies with the origin i from which it is being . 


evaluated and is a constant for all parks as viewed from 


origin ie Therefore the competition effect can properly be — 
assigned to the origin rather than the destinatione In other ) 


words, E(i) may contain a combined "supply of and 


| 


competition of alternatives" effecte This appears most 


clearly in the MacDonald ‘et ale paper (see TN 30) in their 


model wheres: 


(1) VWCi,gjI=kK SCid*¥¥a@ PCLI**B ( DE ACK )/CCi,gk)**Y )*¥*q * 
CAC J) /COCigj**Y) / C OE ACK)/CCiy j )**Y 


(all summations in Equations 1 through 3 are over m)+, 
In this expression 


( OE ACK) /CligZj )**Y *¥*G 
refers to the supply~-generating—-demand effect, while 


1/ @E ACK) /CligZj *¥*Y 


refers to j's competition from all parks as measured from 
centre ie Clearly both expressions are unique to origin i 
and therefore can be thought of as being contained in E(i) 
along with S(i)**a and P(i)**B, so that the above equation 
could be rewritten in a form similar to Cesario's (see 
Equation 2 below in this review) with E(i) detined as in 
Equation 3 of this reviewe 
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The suggestion by Cesario (unpublished) that k in his 
Equation S can be taken to implicitly refer to the 
“competing opportunity" effect is clearly incorrect, if the 
level of spatial competition varies with the Location of the 
origin with respect to the competing parkse If the latter is 
true, then the competition effect varies with origin i, and 
is presumably contained in the estimate of E(i)e 

In the second stage of his analysis, Cesario seeks to 
explain the estimates obtained for all E(i) and A(j) valuese 
A regression analysis shows 77% of the variance in E(i) 
values to be accounted for by origin populatione The fact 
that origin mass terms are very important in "explaining" 
trip flows is well documented in many gravity model studiese 
Cesario suggests that so great is the contribution of 
population that it is not worth trying to refine the 
predictive equation by including other variables or using 
different functional formse The counterargument is that so 
long as absolute flows are used as the dependent variable in 
a trip distribution study, origin mass, be it population, 
manufactured goods or whatever, will inevitable account for 
a large proportion of trip flow variancee The relation is 
almost as inevitable as the results of a regression study 
showing that the variance in the number of deaths in 
population centres is largely explained by their population 
Sizee Both beg a questione In the Latter, the concern should 
presumably be to explain variance in the mortality rate of 
certain age groups rather than absolute mortality figurese 
And in trip distribution studies, the concern should surely 
be to explain variance in the per capita rate of flow 
between origins and destinationse Explaining absolute flows 
and emissiveness‘would appear in general to be a fairly 
trivial problem; explaining flow and emissiveness rates 
almost certainly is note The dependent variable in this case 
would be V(i,j)/P(i)e The more subtle possible influences of 
socioeconomic variables which are dwarfed in the present 
study by the effect of P(i) on V(i,yj) and Eli) would have a 
chance to be revealed, if indeed they have an effect on the 
emissivity ratee Another independent variable which might 
well be included in an analysis of emissivity rates is: 


( OE ACK) /CCigk )**Y )**G-1 


which incorporates both the origin specific supply- 
generating demand and competition effects defined in 
Equation 1 abovee 

Despite the above comments, the paper marks a 


Significant methodological advance in the trip distribution 
Lliteraturee It was the first to estimate values of A(i) and 
E( j) independent of ad hoc assumptions about relevant 
variablese And from it useful progress in trip distribution 
analysis has been made, as exemplified by the two remaining 
papers in this section by MacDonald et ale, and Beaman, to 
which we now turne 

"A Work Plan for the Development of a Mathematical 
Model to Predict and Explain Overnight Use of Parks" by 
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MacDonald et ake is generically similar to the model. 
discussed in the Last paper of this section by Beamane Both. 
are significant extensions of the model discussed in. 
Cesario's paper, although estimation of parameters in the 
former two depends on an initial estimation of | 
attractiveness and emissivity parameters using the analysis | 
of covariance method described in Cesario's papere Although | 
parameter estimation is not discussed in the paper by. 
MacDonald et ale,y Equation 9 could be rewritten as: 


(2) VOigej) = ECCI) ACG) /CCigj **Y 


WHERE EC i), i = 1, 29 eee; nn and AC dg)» Jj = 1, 2% eeog m and 
Y are parameters to be empirically estimated by the method 
discussed in Cesario's paper, and E(i) is defined as | 


(3) E@id=k SCid¥¥*a@ PCLd**¥B (C DE ACK)/CCigk )¥*Y )*¥GB—1 


Given estimates of E(i) and AC j), the Latter equation can be | 
solved for values of a, By Y and ge i 

Unlike the Cheung models where attractiveness is) 
estimated by a predetermined function of specified site 
variables, attractiveness in the MacDonald et ale, Cesario 
and Beaman et ale papers, it is empirically estimable from | 
trip distribution datae In addition, the distance exponent | 
is estimated by a least squares minimising routine, rather) 
than by the trial and error method used by Cheunge | 

MacDonald et ale include in their model 1a supply-. 
generating-demand effect defined as a non-linear function of | 
the sum of all parks'* attractiveness/distance ratiose in. 
model 2, this term is replaced by one defining the 
probability of a potential camper from i visiting any of the 
parks at alle The theoretical underpinnings and derivation | 
of this probability term are much stronger than for the. 
equivalent term in model le However, the derivation of this 
probability relies on the rather strong; and in this case, . 
doubtful assumption that the probability of visiting any 
particular park from origin i is independent of the. 
probability of visiting any other parke It would seem more | 
Likely that the higher the probability of one park being) 
visited, the lower the probability of another being visited, 
in which case the probabilities would not be independent | 
(but see TN 19)e The consequences of not conforming to this | 
assumption deserve to explored before it is used to define | 
one of the terms of model 2. 

With specific regard to the enroute model, which is a_ 
special case of the main destination model, the discussion | 
does not make clear what the definition is of alternative 
parks for an enroute stope If the definition were all parks 
as viewed from a particular link on the highway network, | 
Clisj) would only reasonably be defined as the distance | 
between park j and the nearest point on an arterial highway | 
for parks on that Linke For all other parks, C(i;j) would be | 
the arterial distance plus side-road distance between Link i | 
and park je If only parks adjacent to link i are considered | 
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as alternatives, then the definition of Cli,j) given in the 
paper is reasonablee It also bears emphasis that the enroute 
camper model assumes that on all links the percentage of 
campers who are looking for an overnight stopping place is a 
constante 


An issue raised indirectly in the MacDonald et ale 


paper is that no model exists to predict touring camper 
_choicese It seems probable that many vacation campers have 
regions rather than sites as destinations, and although the 
task of modelling spatial choice that involves (first) a 
region and (second) a route and stopping places may seem 
formidable, the prediction of park use levels may depend on 
such a model to an appreciable extente 


Many of the features of the model have already been 


commented on with respect to the paper by MacDonald et ale 
|The basic issue involved in the Beaman et ale model is to 
define a model which is valid for any number of 
destinations, including a single destinatione At the same 


time the model requires the two features discussed earlier, 


the supply-generating-demand effect of destinations and 
their competition effect. Beaman et ale try to satisfy 
these conditions in Equation 3 which simplifies to Equation 
2 in the special case of the single destination, or isolated 
site as it is described inthe papere The expressions in 
_ Equation 3 that may be least familiar to the reader are also 


those that simplify to the value 1 when there is only one 


destinatione Specifically they are, PP(0)/ OE Plo;d) which 


refers to the "competition from alternatives" effect and dE 
(P(o,d)/PP(o ) )*¥*SE )**SM, which refers to the "#supply 
generating demand" effecte The justification for the first 
expression, which is not completely clear from the text, is 
that the denominator is a measure of competition defined in 
terms of the sum of all  sites* attractiveness/distance 
ratios, identical to the denominator of Equation 9 and 10 of 
MacDonald et ale's papere The numerator which is simply a 


_ constant for any one origin is required for the isolated 


Site case in which the numerator and denominator become 
identical and cancel each othere The justification for the 
second expression is as followse The exponent SM is required 
to define the positive non-linear effect of supply on use, 
but must be less than or equal to 1, to satisfy an earlier 
condition in the paper that with more than one site, total 
visitation will exceed visitation to a single one of these 
sites but will be less than the sum of visitations to each 
Site, if each were the only sitee 

In passing, the latter part of this condition seems 
unnecessarily rigide It seems possible that the sum of 
visits to, Says two sites might exceed the visitation they 
would receive if each were an isolated sitee In any event, 
relaxation of the assumption would simply entail setting no 
limits on the value of SMe SE can be interpreted 
behaviorally as implying that sites with a smaller 
attractiveness/ distance ratio contribute even less to the 
supply-generating-demand effect than their relative size 
would suggeste Once again this assumption could be readily 
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relaxed without any change to the structure of the 
expressione The reason for PP(o) in the denominator is to. 
ensure that one of the terms in the summation wilt always 
equal ltl, so that the summation will always be greater than 1 | 
for the case of two or more sites, and equal 1 exactly for 
the case of the isolated site, no matter what values SE and 
SM take one This satisfies the basic assumption that there. 
will be extra demand stimulated by the provision of an extra 
site, as well as the assumption that the supply generating 
demand effect is unity in the case of a single sitee 
Besides the question of solubility (which is not at. 
issue in this paper) the main question that arises is. 
whether oor not amore elegant, less complex formulation 
could achieve the same effect as this modele Certain. 
elements, such as PP(o) in the numerator of the first. 
expression discussed above are disturbinge In the case of. 
PP(o), the effect is to add a different constant to the 
model for each origine Without some behavioral | 
interpretation of the role of this term in the general 
model, its only function seems to be to satisfy conditions 
for the isolated site model in Equation 2e | 
However, it is not clear why a simple variant of the. 
more straightforward model proposed by MacDonald et ale 
(Equation 9 in their paper) would not more simply accomplish | 
all that TN 11 Equation 3 doese Specifically, using TN 11) 
notation: | 


(4) VCosd)J=C Pop(O) GOO) ( AE (d)PCo,d) )**@ * 
(AC a)**[—-R( D(C 0,d))] / DE (dad) Al d)**[ —R( DC ogd)) ]) 


will in the case of the isolated site simplify to: 
(5) VCo,d)=C Pop(C) GCO) (AC ad )**[ -RC DC 0,d)) ])**o 

The difference is that in the isolated site case the supply | 
generating demand effect will vary non-linearly with the 
value of that site's attractiveness/distance ratioe The two. 
components of Equation 4 can be defined as: 


(6) NNCo) = Poplo) Glo) €( 2E Plo,d) )**¢ 


WHERE NNCo) = the number of actual park visitors generated 
by origin o; and 


(7) plosd =ACd)**{—-R( D(C 0o,d))] / OE (d)**[ —-R( D(C 0,d))] 


WHERE plo;sd) = the probability of a park visitor from o 
choosing to visit d from the set of available alternative 
parks; and by definition the SUM, over oy, plo,d) = le In the 


isolated case, plo,d) = 1, ieee all park visitors from oO 
visit de Therefore, in words, the model estimates the number 
of park patrons generated from 0 as a function of o's 
characteristics as well as of its overall access to parks 
and then assigns each park ad a proportion of o's total 
visitors based on that park's proportion of the total sum of 
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parks! attractiveness/ distance ratiose 

In any casey, the implication is that estimation 
problems can be overcome, both the Beaman and MacDonald et 
ale models constitute important efforts to produce a more 
general and realistic trip distribution modele 
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CHAPTER III 


ATTRACTIVENESS ANALYSIS 


INTRODUCTION 


Attractiveness analysis began early in the CORD Study, 
as did work on visitor flow models, but initially this 
research was not closely tied to visitor flow modelling: 
attractivity of a park was treated as an exogeneous variable 
by Cheung in TN tile In fact, while the work plan for an 
overnight-use model was being produced, and while Cesario 
was making his initial proposal to develop a model, Ross was 
developing attractivity indices for twelve parks in 
Saskatchewane To develop measures giving an ordinal 
indication of the relative attractiveness of different 
parks, he proposed a procedure that has been used to analyze 
the attractiveness of shopping centrese His scheme was 
accepted and is reported in TN 2e 

From the articles in Chapter I, one has’ not only that 
Cheung developed attractiveness measures for Saskatchewan 


parks; the reader will also know that the Cesario modelling 


effort produced a methodology for calculating park — 


attractivenesse The Cesario model has been used to calculate 


attractiveness measures for the same Sakatchewan parks for 


which Cheung and Ross have calculated attractiveness valuese 
So, three different procedures have been used to calculate 
attractiveness measures for certain Saskatchewan parkse The 
results achieved are compared in CORD Study TN 9 which 
raises the gquestion as to whether Cheung, Cesario, and Ross 
were measuring the same thinge But the matter of how to 
compare the Cheung, Rossy and Cesario measures of 
attractivity thas been a cause of some controversye CORD 
Study TN 28 was developed by Cheung because he felt that a 
comparison should be made on the basis of the 
“effectiveness" of various attractiveness measures in 
modelse Cesario has objected to Cheung's use of his measure 
in models for which it was not designede He argues that 
unless his measure can be appropriately transformed for use 
in other models it should be used only in the type for which 
it was developede But this introduction is not the place for 
discussions: N*hest", "effectiveness" and "“appropriateness" 
have to do with the proper structuring of models, a problem 
which is taken up in CORD Study TN 19 and 35 in Chapter VIIe- 

One paper in this chapter, although related to 


attractive analysis, is not directly related to the other | 
paperse In the years 1971-3, the Planning Division of Parks | 


Canada carried out Wild River Surveys to collect data that 


would be used to designate rivers with National Park | 


potentiale The first methodological report produced (based 


on data from the 1971 survey) was Quantitative Comparison of | 
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River Landscapes, which described the application of a 
technique developed by Luna Be Leopold to select wild rivers 
for designation as National Wild Riverse Leopold's technique 
did not focus on perceptions but on determining certain 
properties of the resource basee This analysis, and other 
reports about the wild rivers studied, were produced 
"outside" the CORD Study. 

The only CORD Study paper having to do with the Wild 
River Survey is TN 27, which reports on how data collected 
in a 1972 Wild River Study have been used to develop 
predictions of how expert canoeists will perceive the 
attractiveness of various sites on wild riverse The major 
point made in the paper is methodological, nevertheless it 
is included in this chaptere The paper also has practical 
implications for campsite design, roadway scenic quality 
evaluation, and a number of other tasks which involve the 
evaluation of Landscapee 
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CORD STUDY TECHNICAL NOTE 2 


ATTRACTIVITY INDICES 


Je He C Ross 


ABSTRACT 


In this study, a methodology to estimate the 


attractivity of parks to a given type of user from data 


concerning the spatial interactions of individuals who 
patronize these sites is presentede The model proposed is 
utilized to produce an ordinal scale defining the relative 
attractivity of the alternative sites in the system being 
studiede " 

The 1969 CORD Study Park User Survey data were used for 
this analysise Actually, only information collected at 12 
Saskatchewan parks on 3,254 individual trips were used out 
of a ltarger data sete Documentation of the data collection 
and processing is not provided but other sources are cited 
where this information is available. 

Using the data on origin destination flows just 
referred to, an attractivity ranking of the twelve parks was 
establishede The consistency of the decisions which led to 
this ranking was found to be significantly non-random; 
although there is a degree of confusion evident in how 
different people appeared to rate the parkse Reasons for 
this confusion are introduced that suggest that much of the 
confusion is attributable to the nature of the data, and 
inhomogeneity of user purpose rather than being a result of 
problems with the methodologye 

A postscript contains comments on what kind of research 
to which the original research has lede 


INTRODUCTION 


With all the difficulties involved, it seems 
entirely possible to develop specific, and rather 
objective, rating scales for different outdoor 
recreation areas and for major different uses of 
eache These scales would hawe great utility in 
planning, other research, and administratione The 
talents and knowledge of different kinds of 
specialists might well be used in devising and 
testing such rating scalese (Reference 6) 


A general dissatisfaction with the more traditional 
methods of deriving such scales has led the author to 
develop an approach which avoids the strong metric 
assumptions necessary for the derivation of site attraction 
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scales by the gravity model approach (see Reference 18) yet, 
at the same time, avoids arbitrarily assigning attraction 
values solely on the basis of the number and capacity of 
site facilitiese (See References in TN 9 and 16. ) 
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METHODOLOGY 


The model of human behavior proposed is comucerned with 
attractiveness, AC j), which is a measure of an inherent 
property of site j; and distance D(r,j); a measure of the 
difficulty of travelling from the residence of visitor r to 
site Je It is assumed that the effects of increasing 
distance are such that the "degree of impedance" always 
increases as distance increases - although the function 
relating impedance and distance need not be defined (see TN 
14-) This assumption really means’ that for any distance 
D(r,j), travelling further is seen as being more costly and 
less desirable unless something about the destination 
reached defrays (compensates for) the extra travele Wolfe 
(Reference 24) implies that this assumption is not always 
valid, but it is accepted heree Now, in one dimensional 
space, consider an individual residing at point ras shown 
in Figure le The assumptions stated imply that choosing to 
patronize a site at -point 2 means that the site is more 
attractive than Site le If the individual had gone to Site 1 
no Similar statement could be made about the relative 
attraction of Site 1 compared to Site 2. For the visitor, 
distance rather than site attractivity may have been the 
"force" resulting in visit to 1 rather than some _ more 
attractive sitee 


FIGURE 1 


TWO-DIMENSIONAL SPACE 


* Site 3 


bak SCE Me ed Mees abet) Sc Meera MRR, Sd 8 een Oe, fee Meee es eA, 
R Site 1 Site 2 
* 
Site 4 
<>< <9 <> 


The extension of this reasoning to two dimensional space involves. 
assumption that each individual perceives a given distance to be 
of the same magnitude regardless of the distance he must 

travele If this assumption is made one may state that a person 
implicitly judges the site he selects to be more attractive than 
any alternative site which is closer to his origin than the 
selected sitee One can make no such judgments with regard to the 
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relative attraction of sites which are more distant than that 
which an individual selectede Soy, in Figure 1, if one accepts 

that the individual's residence is located at point ry white the 
points 1 - 4 represent alternative sites, a number of inferences 
can be madee For example, if the individual chooses to 

visit Site 4 it can be assumed that he judges that site 

to be more attractive than any other site closer to his residencee 


The Comparison Matrix 


Now, if for a number of parties we consider that one individual 
made the decision to visit a particular site, the number of 

times that any site i can be inferred to be more attractive 

than any other site j can be recorded to form a site-by-site compariso: 
matrix C (Figure 2)e In this matrix the i,jth entry is the number 
of times that site i was chosen over site je In order to prepare 
such a matrix for further analysis, one must, taking each sample 
individual in turn, calculate the distance from his 

residence to each of the alternative sitese Then denoting the site 
visited as i, Cli,j) is incremented by one for every site j which 
is closer to the individual's residence than site ie 


FIGURE 2 


COMPARISON MATRIX: DISTANCE = 200 MILES 


Site* 1 2 3 4 5 6 7 8 ) 10 11 12 
J 0 0 1 61 2 2 1 1 =) 8 4 14 
2 6 0 3 5 5 5 1 5 ay, 6 1 21 
3 13 a O 14 47 10 3 10 10 14 3 10 
4 11 0 8 0 ett $0, 4, 6 7 19 5 & 
2 Z 0 9 14 0 2 1 9 1 7 1 6 
6 5 0 5 Ss 8 0 2 4 4 8 2 4 
£4 42 3 4 10 7 #13 0 1 S51 44 40 43 
8 20 1 2 20 4 1 1 0 2 19 0 18 
9 56 2 5 93 650 zi 1 43 0 55 0 55 

10 203 3 Pe PP | 1 1 1 2 8 0 1 9 
11 6 6 1 4 2 S 5 O 146 99 0 5 
12 191 1 4 30 6 7 3 7 4 33 4 0 


* The names of sites 1 to 12 are given in Table tle 


<< SASS 
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From the comparison matrix Cy, a proportions matrix P 
may be determinede The entries in this matrix give the 
proportion of times that any site i was judged to have a 
higher attraction than site je An entry in the P matrix is 
defined as: 


P(Ci,Jj) = CCisJ DC 1COisJj) + C( j,i)) 


WHERE P(i,Jj) the proportion of times site i was chosen 


over site jy» 


C(i,JjJ) = the number of times site i was chosen over 
site j» and 
C( j,i) a 


number of times site j was chosen over site ie 


Because situations arise in which site i and site j 
were never compared, the fact that the comparison is missing 
must be notede Throughout this paper, the value -1.00 is 
used to denote thisSe 

In an ideat situation, when all individuals have 
similar perceptions as to which sites are most favourable, 
the elements of Py, Pli,syj) take only values of 1 and Oc In 
the real world, however, especially when the sites being 
compared are very similar in attraction, the walue of P(i,j) 
will be intermediate between 1 and Oe 

Psychophysicists, in attempting to derive measurement 
scales from Similar paired comparison matrices, have | 
developed a number of approachese Many of these are based on) 
the "Law of Comparative Judgment" developed by Thurstone _ 
(see Reference 22)-e If we were to use Thurstone's approach, | 
we would have to assume that each entry in the table could. 
be viewed as a normal deviation from the column meane But, 
due to the fact that the individual subject's inferred 
decision that one site was more attractive than another was 
conditional on the spatial arrangement of the sites, one 
cannot regard the P(i,j) as normally distributed variablese | 
Specifically, the Large numbers of zeros and ones in the. 
matrix present problems because the probability of getting 
either under Thurstone's assumptions is infinitely smalle | 

As an alternative approach which avoids these problems, | 
regard any P(i,j) greater than 1/2 as indicating a majority 
preferring i, and any value less than 1/2 as a majority 
preference for je When this is done, one can construct a 
scale which is consistent with these majority judgments by) 
finding the average value of each row of the P matrixe These 
values are found by summing the valid entries in each row 
and dividing by the number of valid entries in that rowe 
The result is defined to be the index of attraction: | 


AC i) = e(i,j)P(Ci,j)/ OE eC i,j) 


WHERE A(Ci) the attraction index of site i 


P(Ci,Jj) the proportion of times site i was chosen over 
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site j, and 
e( i,j) = 1 if PCi,j) is not equal to -1-00, O otherwisee 
and the sum is on je 


Values of the index of attraction calculated using the 
matrix presented as Figure 3 are presented as Figure 4. 


Index of Confusion 


A further measure, the index of confusion, may be 
calculatede It is defined as the proportion of valid 
entries in each row of the P matrix which indicate unanimous 
decisionse The preceeding implies that it is calculated by 
dividing the number of entries in each row by the number of 
valid entries in the row, and subtracting the result from 
1<M. Values of this index based on the P matrix presented as 
Figure 3 are shown in Figure 4e 


Consistency 


Consider three sites, ig jy and ke If Ai) is greater 
than A(j), and AC j) is greater than A(k), then A(Ci) must 
clearly be greater than A(k)e Such a triplet is termed a 
transitive or non-circular triade The weak transitivity of 
the P matrix is defined as the degree to which: 


IF PCisgj) > e50 and Pl jek) > e505 then Pli,k) > e50 for all 
ig J? and Ke 


If the P matrix is incomplete the standard formulae for 
determining the consistency of the matrix cannot be applied 
directlye Instead, the n(n-1)(n-2)/6 possible triads must be 
examined separatelye When the P matrix is incomplete, four 
different outcomes are possible when inspecting a triade 
These are: 


le the triad may be non-circular, 

2e the triad may be circular or intransitive, 

3e the triad may be incomplete but of the form P( i,j) > 
5 and P( i,k ) > 05, 


- if this is the case it must be transitive, and 


4. the triads may be unknown because two elements are 
absent, (-l occurs twice )e 


Kendall's Coefficient of Consistency (see Reference 15) is 
defined as: 


K = 1-0 — (NC/MC ) 


WHERE NC = the number of circular triads observed, and 
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FIGURE 3 


PROPORTIONS MATRIX: DISTANCE 200 MILES 


Site* 1 2 3 4 5 6 

1 0-0 0-0 0-07 O85 O4.22 O0429 
2 1-00 0.00 0-43 1-00 1-00 1-00 
3 0093 O657 O0400 O64 0484 067 
4 0e15 O00 0636 044600 O39 0667 
5 0e78 O40 0e16 0-61 0-00 0-20 
6 Oe7J1 O.0 0e33 0633 O80 0-00 
7 00-98 04675 O57 OeJ1i Oe88 0687 
8 0-95 O0«17 Oe17 OFF Oe3Bl O0620 
9 O0e92 O4«22 O4633 O88 04.98 0464 
10 0.96 0.33 O-.0 0-92 O13 O11 
11 0-60 O86 O6«25 O44 0467 O71 
12 0-93 0-05 04629 O79 O50 04.64 
Site* 7 8 9 10 11 12 

1 0-02 0.05 0.08 O0404 0440 O-07 
2 0625 O6«83 O6478 QOe67 O14 O95 
3 0643 O6€83 O6c67 12-00 O75 Oe71 
4, 0e29 Oe23 Oel2 Oe08 O56 O-21 
5 00613 0669 064602 O488 04433 02.50 
6 0013 Oe80 04636 O689 O629 0-36 
7 02-00 O6.50 O5«.98 O0«98 O89 0-93 
8 0e50 06.00 O-04 O90 -1-00 0672 
9 02-02 Oc«96 0.00 O.87 0-0 0-93 
10 0-02 0-10 0613 0-00 O-O0O1L 0.21 
11 Oe11 -1-00 1-00 0.99 0.00 O56 
12 02.07 O4428 Oc.07 O79 O4.44 0-00 

* —-1-200 indicates a missing valuee 


NOTE: The names of sites 1 to 12 are given in Table le 


OLOLOLOLS: 


MC = the maximum number of circular triads possiblee 


Reasoning that the maximum degree of inconsistency should be. 
observed judgments between any sites are made at random, the 
following procedure has been adoptede Create a dummy P- 
matrix's, by replacing all the valid entries below the. 
diagonal with a uniformly distributed random number between. 
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FIGURE 4 


REORDERED PROPORTIONS MATRIX 


Site* 7 2 3 11 9 & 
7 0-00 O75 OcS7 O89 O0-98 0-50 
2 00e25 0400 O0O6443 O14 O6478 0483 
3 00.43 O57 O600 Oe75 O67 0483 
11 Oo-11 0686 0625 0-00 1-00 -1-00 
9 0-02 Oe22 0633 0-0 0-00 0.96 
8 0e50 Oc17 O17 -1-00 0204 0-00 
6 0-13 0-0 0633 O6«29 O06«36 0-280 
12 0-07 O-05 06429 O44 Oc07 O28 
5 0213 040 0-16 0633 O02 0-69 
4 0-29 0-0 0e36 0456 O012 0423 
10 0-02 0233 O-0 0.01 0.13 0.10 
1 0-02 0.0 0-07 O4640 O608 0205 
Site* 6 12 5 4 10 1 
7 Oe87 O6493 ODeS8B OeT71 O-98 0-98 
2 1-00 0.95 1200 1-00 Oc.67 1-00 
3 0-67 O71 O84 04-64 1-00 0-93 
11 Oe71 O456 Oe67 O-e44 O699 0-60 
9 06-64 04693 Oc98 O04.88 Oc6c«87 0-92 
8 0e20 O672 Oe31 O77 O90 O-95 
6 0.00 O«36 O6«80 O4433 O89 O71 
12 00-64 0-00 O0O4650 06.79 0.79 O93 
5 0020 0.50 0.00 Oc61 O688 0678 
4 0-67 O21 O06€39 0-00 O-08 0-15 
10 Oell O2421 O13 O92 04.00 0-96 
t 0629 Q-07 O22 O485 0-04 0.0 
* (-1-00 indicates a missing value) 


OLOZO LOLS, 


(and including) 1-00 and 0-004 Pli,sj)'s above the diagonal 
are relaced by the complement of the below-diagonal numbere 
That is to say: 

ond gj PS xs P"*( j,i) = 1-00-X 


WHERE P'(i,j) = the i,jth entry in the dummy 
proportions matrix P*', and 


X = a uniformly distributed random number between 


ZN 2 CHAPTER III page 177 


1-00 and 0-006 


The number of intransitive triads in this dummy matrix is 
counted and storede The operation is then repeated, in our 
case twenty times, and a running sum and sum of squares of 
the number of circular triads is kepte This allows the mean 
and standard deviation of the number of circular triads to 
be calculatede The mean is taken as a guide to the number of 
intransitive triads which usually occure The standard 
deviation allows one to place confidence Limits on the 
difference between this number and the number of circluar 
triads observed, thus permitting one to text the hypothesis 
that fewer triads have occurred because there are real site 
preferencese Verifying whether or not the observed number of 
triads is significantly less than the number expected by 
chance is the statistical test endorsed to show that the 
results of a given analysis show a real preference 
structuree : 


Visual Interpretation of the P Matrix 


Although the Proportions matrix may be interpreted 
visually it is helpful to reorder the rows and columns of 
the matrix in such a way as to place the highest scoring 
Sites at the top and left of the matrixe Figure 4 presents 
the data of Figure 3 reordered according to high to low site 
attractiveness as given in Table le Ideally, if the matrix 
were perfectly transitive, and all entries unanimous, all 
values to the right and above the diagonal would be ones and 
all to the left and below zeros however, such a situation 
seldom arisese 

Several useful observations can be made from an 
inspection of the recorded matrixe The inconsistent 
judgments can be identified as the values greater than -5 
below the diagonal, or as values greater than -5 above ite 
Similarly, the degree of confusion about the attractivity of 
any Site may be readily observede So, although the display 
of the P matrix in this fashion adds no new information, it 
is useful in pinpointing sites which do not appear to fit 
into the general pattern of the system being studiede 


DATA BASE AND PREPARATION 


The study was based on Saskatchewan data collected as 
part of the CORD Study Park User Surveys (see the Data 
Documentation Volume )e The data received by the author were 
in the form of lists of users at twelve parks in 
Saskatchewane Each individual record contained the user's 
origin and the date he visited the parke These data were 
tabulated by hand to get origin destination flowse These 
flow data were then keypunched by punching the code number 
of each park, the coordinates of each locatable origin from 
which visits to that park were made and the number of such 
visitse In summary, the data analyzed consisted of one card 


bo 
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TABLE 1 


INDICES OF ATTRACTION AND CONFUSION: 200 MILES 


Maximum Distance = 200 Miles 


Index of 


Site Site Name Attraction Confusion 
1 Buffalo Pond e190 e909 
2 Cypress Hills e732 0545 
3 Duck Mountain e730 e909 
4 Echo Valley e277 e909 
5 Good Spirit e390 e909 
6 Greenwater e456 «909 
a Prince Atbert e821 e727 
8 Moose Mountain e473 e900 
9 Pike Lake e614 e727 

10 Rowan's Revine ° 266 e727 

11 Battlefords e619 e900 

TZ Besant 2439 e727 
<> <>< 95> 


for each origin, each card contained the identity number of 
the origin, the origin name, its coordinates, and the number 
of visitors from that origin to each of the twelve parkse 
(For more detail on the data organization see Reference 25-e) 
| One of the flow patterns actually observed is 
_ reproduced as Figure Se Simitar material was prepared for 
one way travel with upper Limits of 220 and 867 miles (for 
these results see Reference 25)e One may find it interesting 
to note that Figure S is based on a total of 3,254 tripse 


Results Of The Study 


The attractivity indices program was run twice, the 
first run disallowing any trip exceeding 200 miles in tlength 
(one-way), and the second, which is not reported here, 
omitted those longer than 225 milese The magnitude of the 
distance Limit imposed is of course arbitrary; but about 200 
mites one-way as an upper limit on day trips appears 
reasonable to the authore Regardless, the imposition of this 
Limit invalidated less than three percent of all tripse 

Inspection of the comparison matrix C(Cintroduced 
earlier in Figure 2) reveals that the number of visitors 
which may be inferred to have compared any two sites from 0 
in the case of Sites 8 and 11 to 241 in the case of Sites 4 
and 10. Due to the method of calculating the proportions 
| matrix P (Figure 3) by dividing Pli;j) by Pli,j) + PCgjsJj) 
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FIGURE 5 


SASKATCHEWAN PARKS 
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the varying numbers in the cells of the comparison matrix 
need not prompt undue concerne 

It is evident from the proportions matrix P that almost 
one-third of the entries suggest that users deviate from 
unanimity by as much as 25% in showing their preferencese 
This indicates that there is no concensus as to whether one 
site is more attractive than anothere Indeed, only 12 of the 
66 comparisons are unanimouse Examination of the confusion 
indices (see Table 2) indicates that there are differing 
degrees of confusion for different sitese There is clearly 
no site about which there is no confusione 

The attraction indices (Table 1) ideally have a range 
of 1 (for a site which was never judged to be less 
attractive than any other site) to 0 (for a site which was 
never judged to be better than any other’  site)e These 
indices, although regarded as only ordinal in nature, allow 
one to rank the sitese This was done in Figure 4, using the 
information from Figure 3e As described earlier, it is in 
terms of Figure 4 that one may understand a great deal about 
the preference structure discoverede 

It remains to assess the proportions matrix to 
determine the statistical significance of the results of the 
analysise There were 17 circular triads counted, and after 
20 simulations the average number of circular triads 
observed in the randomly generated P matrix was 53, with a 
standard deviation of 6-09 (this was true with both 200 and 
225 mile travel Limits)se Thus it may be concluded that since 
the observed number of circular triads was 5-9 standard 


deviations from the mean, the results obtained were 
Significantly non-random and there was a high degree of 
agreement between the sample subjects regarding the 


attraction of sitese 


DISCUSSION 


Although at present (1971) there is no way of assessing 
the correctness or accuracy of the results of this study, 
apart from reviewing the logic of the approach and the 
accuracy of the computer program used to analyze the datae 
(This is being written from a 1973 perspective, and a review 
of the methodology outlined above has been carried oute See 
TN 9 for detailse 

It is the authors's opinion that the apparent 
difficulty of making judgments as to whether one site is 
more attractive than another (as revealed by the Lack of 
unanimity in the P matrix) is rather disappointing, although 
if all twelve parks are considered to have similar levels of 
attractivity it is logical to find a relatively high degree 
of confusion between theme In a previous study (see 
Reference 13) eighty percent of the choices resulted in 
unanimous decisionSe However, a great deal of the problem 
may be traced to the nature of the data set itselfe The 
classification "“day-user" may not result in a sufficiently 
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homogeneous class of users as it includes fishermen, 
picnickers and swimmers, to name but a few, all of whom may 
perceive the attractiveness of a particular site 


differentlye Also, in the case of this study, there are many 


provincial, local, regional and private recreation areas 


which provide alternate destinations for day tripse No. 


comparable visitation data for these sites was availablee 


Furthermore, we have no way of knowing whether or not. 
the individuals who visited any given site felt afterwards 


that the attraction of the site had justified the expense of 


the trip to ite In other words, the assumption of a well- 
informed user implicit in most models of recreation 
behaviour may not be valide Ideally, this approach is best. 


applied to situations in which it may be assumed that the 
individual has sufficient knowledge about all the 
alternative sites closer to his origin than the site he 


selects to make a rational decision as to which offers the 


greatest satisfaction’ —- presumably that which he patronizes 


most oftene With the present data one does not have this 
information, but only knows that a trip was made to a 
specific site on at least one occasione 

Finally, the question of whether oor not the entire 
benefit the individual receives from his day trip can be 
attributed to the site he chose to visit is one which has 
been addressed heree 


CONCLUSION 


The method employed above to analyze the Saskatchewan 
day use data has provided a ranking of twelve Saskatchewan 


parkse The consistency of the decisions which led to this. 


ranking has been found to be significantly non-random; 


although there is a certain degree of confusion evident in. 


the ordering of the pairwise comparisonse Still, having some 
behaviourally based attractivity assessment for parks is 
better than relying on intuitione This is particularly true 


when it is recognized that possible reasons for the apparent . 


high confusion about various parks! attractivity have been 
pointed out as probably attributable to the nature of the 
datae If the conjectures about why the levels of confusion 


were so high are correct, then the method offers even more. 


promise than is evident from the results achievede 


ATTRACTIVITY INDICES —- POSTCRIPT 


The main objectives of the attractivity indices model 
just presented were to determine a unique rank ordering of | 


service sites with respect to their inherent attractions and 


to establish the degree of consistency present in the © 
individual inferred judgments upon which the ranking is | 
basede Following the establishment of consistent indices of. 
site attraction, investigation of a data set may proceed by 


either modelling of the spatial choices inferred, or by the 
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statistical explanation of the "pure" attraction indices 
which have been revealede Each of these approaches is 
discussed belowe 


‘The Concept of Preference Surfaces 


| The concept of preference surfaces is based on the 
assumptions that an individual in need of a service (in this 
case recreation) attempts to minimise some function of the 
attractiveness of each possible alternative and the distance 
to that alternative, and that each individual faces what may 
be regarded as a unique array of spatial alternativese The 
suggested thought process that each individual employs is a 
mental search routine whereby each alternative spatial 
opportunity is compared against all other available 
opportunitiese He then chooses that alternative which will 
maximise his benefite If, for example, we postulate a 
preference function of: 


UCisJ) 


ACJ)/DC iJ) 


the net benefit which would accrue 
to a visitor from origin i who chose 
to patronize site jy, 


WHERE U(i,Jj) 


ACj) 


the attraction of site j, and 


DCiss) the distance between i and je 

We define a preference surfacee Such a surface would be 
similar to that shown in Figure 6« Given such a surface, one 
can proceed to inspect actual choices made to determine the 
extent to which the inferred choices were consistent with 
the postulated surface - that is to say, the proportion of 
times that any subject i chose to patronize that site J 
which yielded the thighest value of Ue One should note that 
in using a preference surface model it is assumed that a 
subject will be indifferent to the various spatial 
opportunities which lie on the same "contour" of the 
surface, e€ege those opportunities iy; Jj, kyl, and myn for 
which UCi,j = UC k,l) =UC myn de 

: Initial investigation of the degree to which the 
observed recreation patterns in the Saskatchewan day use 
data used to develop the results presented earlier in the 
paper are explained by various surfaces has been carried out 
at the University of Western Ontarioe Results obtained 
(based on only the 600 individuals) appear promisinge (See 
Reference 13. ) 


ESTIMATION OF INFERRED SITE ATTRACTION INDICES 
FROM OBSERVED SITE CHARACTERISTICS 


Various attempts to quantitatively relate the number 
and quality of facilities to derived attraction indices have 
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FIGURE 6 


HYPCTHETICAL PREFERENCE SURFACE 
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Height at any point (i,j) = Uligj) = A/D, 
WHFRE A = the attraction index of alternative sites;3 


D = distance to each alternativee 


NOTE: The origin of the surface is always located at the 
individual's homee | 


An individual would be indifferent to alternatives x and y 
(because the utility of both is e33), due to the fact that. 
the added attraction of y is compensated for by its greater 
distancee He would, however, favour z over either x or ye | 
It should also be noted that an individual's disciminatory| 
powers are not perfect, and that each of the infinite number. 
of contours which may be drawn on this surface really 
represents only the bisector of a zone of indifferencee | 


CLOT OLOLS 
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been reported in the Literature (see the literature review 
in TN 9)e Although several studies have reported a high 
degree of explanation, the combinatorial rules followed have 
generally been specific to each data set, thus rendering the 
formulation of general rules difficulte One of the major 
problems appears to be that there is little evidence that 
the supply of facilities and user response to them is a 
Simple, relatively invariant one - although in certain cases 
(such as that treated by Wennergen and Neilson in Reference 
23 in their analysis of demand for fishing) it may very well 
be the casee In more complex situations it would seem that 
response to an increased number of any particular type of 
recreation facilities on one site should be a function of 
the relative availability of that facility on all 
alternative sites, and of what a person (party) really wants 
to do, rather than solely a function of what is at the site 
in questione ‘ 

In view of difficulties involved in estimating the 
influence of increased diversity of facilities on consumers, 
it is believed that, at this time, the investigation of the 
relationship between site variables and attraction scores 
should be approached cautiously, possibly through the use of 
non-metric techniquese (See TN 4 on the use of metric 
techniques in recreation analysis problemSse ) 
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CORD STUDY TECHNICAL NOTE 9 


THREE METHODS FOR MEASURING 
THE ATTRACTIVENESS OF A PARK - 
A COMPARISON 


Je Beaman 


ABSTRACT 


Three methodologies designed to measure the 
attractiveness of outdoor recreation areas are discussed in 
this papere The discussion is on the attractiveness of 12 
Saskatchewan parkse .A good rank correlation agreement 
between the Ross ordinal measure of attractivity and the 
Cesario interval measure, but only a weak correlation 
between Cheung's interval attractivity measure and the other 
measures is demonstratede 

Regression analysis results indicate that Cesario 
measured the attractiveness of a site plus possibly what is 
around it, while Cheung measured only the attractiveness of — 
a particular sitee The regression relation derived between | 
the Cheung and Cesario attractivity measure was used to | 
calculate a corrected Cheung measuree But this "corrected" 
Cheung "destination area attractivity measure" had a lower > 
rank correlation with Cesario's measure than Cheung‘s 
uncorrected measuree 

The paper ends with the very tentative conclusions — 
that: (1)Ross'* and Cesario's indices of attractiveness can > 
reasonably be considered to measure the same thing (though | 
Ross in ordinal and Cesario in interval scale terms); and. 
(2)Cheung measures something different from Ross and 
Cesario: specifically, Cheung does not measure the | 
attractiveness of "general" destination areas but develops a) 
site-specific measuree 


INTRODUCTION 


The relationship between the use and the attractiveness 
of parks has intrigued many outdoor recreation researchers — 
and, consequently, a number of sophisticated studies on park | 
attractiveness have been generatede It is generally agreed 
among outdoor recreation researchers and planners that park-— | 
use levels are affected to a large extent by the 
attractiveness of parkse This general agreement evolved as 
the result of the simple observation that one park receives. 
relatively more visitors than another (over the same time 
period) from a population centre, indicating that the 
difference in distance of the two parks from the population 
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centre is clearly not the only factor tleading to the 
different levels of visitatione 

If attractiveness is a valid concept, knowledge of 
attractiveness factors will enable a planner to make 
rational decisions about providing, preserving or developing 
these factors to make parks attractive, at least to a 
certain group (market). Thus there are both practical and 
academic motives for conducting attractiveness studiese 

It would be tedious to continually -reiterate that 
Cheung, RosS? and Cesario, whose measure of park 
attractiveness are considered, recognized the issue of 
attractiveness to whom, for whate But if one is to avoid 
needless qualms about the ideas presented here, it must be 
recognized that the Cheung, Ross and Cesario attractiveness 
measures discussed are for a fairly well defined type of 
Park usere in other words, in what follows there is no 
endorsement of the fact that attractiveness of a site has 
meaning except in the context of a “reference group" of 
users, main destination day-userse Admittedly this "group" 
is not homogeneouse A more elaborated analysis should define 
week day and weekend attractivities separately and consider 
attractiveness in relation to main trip purpose (eeg 
picknicking, fishing, etce)e However, even the reference to 
main destination day-users is not continually repeated 
because reading and re-reading it becomes tediouse 


REVIEW OF LITERATURE ON PARK ATTRACTIVENESS STUDIES 


More than a decade ago, Clawson and Knetsch (See 
Reference 6) suggested the possibility of developing 
specific and "rather objective" rating scales to measure the 
attractiveness of outdoor recreation areaSse Subsequently, 
various schemes to assess attractiveness were devised and 
reportede One type of attractiveness measure involved a 
particular characteristic of the site under consideratione 
For instance, Grubb and Goodwin (see Reference 14) and the 
Corps of Engineers (see Reference 8) used the surface area 
of a reservoir as a variable reflecting its attractivenesse 
‘(See also Reference 23.) Extending the concept of using a 
physical characteristic of a site as a variable reflecting 
site attraction, Cesario and others (see Reference 3) used 
several Site characteristics to construct an attraction 
index for parkse 

Taking a different approach, Van Doren (see Reference 
(12) conducted an “evaluation" of site attractivenesse Ina 
camper study, he devised a camping attractiveness index for 
each of fifty Michigan state parks based on fifty-five 
variables related to (a) outdoor recreation activities, (b) 
natural environmental resources, and (c) camping facilities 
-and servicese Ellis used and developed Van Doren's approach 
to obtain measures of attractiveness that he employed in 
systems modelse He also developed a measure of his own (see 
Reference 9, 102.) Recently, Ellis and Ker (Reference 11) 
used the Van Doren approach to study the attractiveness of 
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skiing areas, while Auger (Reference 1) used it to study the 
attractiveness of Quebec parkse 

Both of the approaches described to this point have 
something in common, in that they relate park attractiveness 
to the characteristics of a park rather than deducing a 
park's attractiveness from the behavior of its usese Cheung 
followed the lead of Clawson and the others cited and 
suggested that the attractiveness of a park for main-—- 
destination day-use is a function of the following form (for 
details, see TN 1): 


(1) TC j) = OE Sle) OE R(m) Q(m) 


WHERE: 
TC Jj) 


attractiveness of park jy» 


SCe) = relative popularity rating of activity ey, 
R(m) = relative importance rating of facility my, 
Q(m) = rank or score of facility my according to 


its quantity or qualitye 


In defining the measure it is accepted "that although 
not everybody will judge a facility in the same way, or 
perceive the same site factors as being the determinants of 
site attraction for day-visitors, there is a general 
concensus which can be reached" (quoted from unpublished 
material by Cheung )e Indeed, Clawson and Knetsch (see 
Reference 7) have remarked, “Individual tastes vary greatly, 
yet there is some concensus as to what is good and what is 
fair, and there would often be general agreement as to what 
is poor". 

In contrast to procedures that define park 
attractiveness based on park characteristics, Ross (see TN 
2) suggested that the attractiveness of a site may be 
defined by studying consumer preferences, if certain 
conditions are Satisfiede Basically, Ross considered that 
people show a preference when they travel to a site that is 
further than some alternative that would "serve their 
purpose", Employing logic to make deductions from his 
assumptions, Ross devised a three-step scheme to derive 
attractiveness measures for the same Saskatchewan parks for 
which Cheung had derived attractiveness figurese 

The Ross and Cheung attractiveness measures are two of 
the three measures compared heree The third measure 
discussed is based on park users’ behaviore Cesario (see 
Reference 4, 5 and TN 4) hypothesized that the number of 
visits, Wli,sJj), made from an origin (i) to a destination 
(i) is a function of a number of explanatory variables 
related to (1) some characterisitcs of the origin (i), (2) 
some characteristics of the destination jy and (3) some 
costs due to the spatial separation of (i) and (j)e He 
proposed: 


| 
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(2) VCiegj) = KECi DAC J )FCCCisgj) Jel(Cisj) 


WHFREs: 
VCisgJj) = number of visits made from origin 
(i)to destination, 
(j) during a time period, 
k = a proportionality constant, 
E( i) = emissiveness, or origin effect, of (i), 
A( j) = attractiveness, or destination effect, of (Jj), 


£0 CCi,szj)) = a function of the cost, Cli,yj), for getting to 
destination (j) from origin (i) 


e( i, j) = an error terme 


The fact that Cesario discussed how to derive the 
relationship between his attractiveness measure and park 
characteristics should not be taken to indicate that this 
measure is Like the Cheung measures Cheung defines the 
relation between attractiveness and park characteristics, 
while Cesario suggests that after peoples’ response to a 


park has been estimated by obtaining an attractiveness value 


for a particular type of use, similar attractiveness values 
can be analyzed to see how peoples’! responses are related to 


park characteristicse (TN 29 suggests a related strategy 
for assessing people's response to the supply of facilities 


available to theme ) 


THE ATTRACTIVITY MEASURES ANALYZED 


As noted earlier, Cheung and Ross obtained main-—- 


Biestination day-use visitation attractivity indices for the 


same twelve Saskatchewan parkSe Because their day-use 
attractiveness measures were available, Cesario calculated 
his measures for the same parks, with the assumption that 
the emissiveness of origins was proportional to the number 
of people resident in the origin arease (On why this could 
or should be done see TN 4e) Thus, the attractivity measures 
presented in Table 1 were available for analysise 


Comparison of the attractivity measures should not be 


made using the Pearsonian correlation coefficient because 
Ross! measure is ordinal; so analysis was begun by computing 


the rank correlation coeficients shown in Table 2e 
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TABLE 1 


THE ROSS, CESARIO, AND CHEUNG ATTRACTIVENESS 
INDICES FOR TWELVE SASKATCHEWAN PARKS 


Park T(r) Rank T(f) Rand T(c ) Rank 
Buffalo Pound e190 12 02155 2 96212 5 
Cypress Hill e732 2 32083 2 45226 11 
Duck Mountain ©7730 3 22609 4 126.40 1 
Echo Valley eked 10 0-477 10 112.05 3 
Good Spirit e390 9 02666 9 76256 8 
Green Water e456 7 0-898 6 61.46 9 
Prince Albert e821 1 12416 5 88e75 7 
Moose Mountain e473 6 30846 1 113.11 D? 
Pike Lake e614 5 0-704 ff 96.10 6 
Rowan's Ravine e266 11 0.683 8 59-01 10 
Battleford's e619 4 20«886 3 104.28 4 
Besant e439 8 02373 11 26-60 12 

T(r) -— Ross* attractiveness indices 


T(f) - Cesario's attractiveness indices 


T(c) —- Cheung’s attractiveness indices 


<> <P KILI<LD 


DISCUSSION 


When Table 1 is examined, it becomes clear that a major 
reason for the high rank correlation of eJ7J7T between 
Cesario's attractiveness measures and those of Ross is that 
Six parks have almost the same ranks on the two measureSse 
Actually, the ranks of six parks agree fairly well when the 
Cheung and Cesario attractivity measures are compared, but 
in this case an overall rank correlation of only e25 is 
observede 

However, having seen how well the Cesario and Ross 
attractivity measures agree, one should note that the Cheung 
measure possibly should not exactly agree with the Cesario 
attractivity measuree Cesario claims that his emissiveness 
measure includes an alternative factor component and because 
of the symmetry of his equation it is plausible to suggest 
that attractivity is also affected by a destination 
alternative factore Cheung’s measure, on the other handy, 
deals only with a given sitee It is possible that Cesario's 
measure reflects both positive and negative effects of the 
area around a parke (This point is elaborated in TN 33e) 
This suggests that the Cesario attractivity measure, T(f), 
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TABLE 2 


RANK CORRELATION COEFFICIENTS 
BETWEEN THE ROSS, CESARIO, AND CHEUNG 
ATTRACTIVENESS INDICES FOR 12 SASKATCHEWAN PARKS 


T(r) T(f) T(c) 
T(r) x Pe bi | Pod Te | 
TCL) x e©10 
T(c) x 
T(r) -— Ross attractiveness indices 
T(f) - Cesario attractiveness indices 
T(c) - Cheung attractiveness indices 
<><><><D<> 


could be a function of Cheung’s attractivity measure, T(c)z, 
and Cheung's (1972) alternative factor A(c): 


(3) TCH) = £°0T0c), ACc)) 
which might bes: 
(4) T(c)/T(£) = CO — Cl Alc) 


The possible existence of such a relation is 
demonstrated by the results presented in Table Je In 
particular, it is interesting to note that by using Equation 
D9 


(5S) T( 4) = T€lc)/27A(c) — 125 


based on parameters for Regression (1) of Table 3, one 
predicts the twelve “Ycorrected" values of Cheung'‘s 
attractivity measures given in Table 4. But when rank 
correlations are carried out using the “Ncorrected" Cheung 
measure, one notes that the correlations reported in Table 4 
are observede The correction has not improved the rank 
correlation between the Cheung and Cesario attractivity 
measures as might have been expectede 


Even though the results raise questions without 
providing impressive evidence for a given answer, the 
relation defined by Regression (1) and, in fact, the 
relation implied by any significant regression coefficient 
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TABLE 3 


RATIO OF CHEUNG'S ATTRACTIVENESS MEASURE, T(c),y 
TO CESARIO!'S ATTRACTIVENESS MEASURE, T(f), 
EXPLAINED BY A DESTINATION ALTERNATIVE FACTOR, | 

As T(c)/T(CL) = CO + CLA 


Regre CO Ci A= Alternative F-test Re 
Num Factor defined Significance | 

by Levelt 
1 -~126 27 Noe of alternative 10% 0.30 


sites within 100 
miles of the park 
under consideration 


2 -49 144 1/D **1/2 as used by Not | 
Cheung 1972 Significant 0.18 © 

| 

<><><><><> , 


of the alternative factor means that when a site is. 
"imbedded" in a collection of other sites, its "true" or 
"absolute" (Cheung type) attractivity is not estimated by 
the Cesario attractiveness measuree In fact, @ Significant 
coefficient suggests that persons may be making decisions | 
based first on selecting a region and then selecting a site 
within a region (rather than considering all sites as. 
discrete )e It is possible to believe that persons may view a_ 
number of parks in an area as equally attractive (in. 
Cheung's sense )e They may view similar parks in the same 
area much as they view a Single park with a number of 
different day-use arease In this case, visitor flows to an 
area, not to each site, would be what reflect regional | 
attractivity.e In fact, a sophisticated analysis of total 
flow might give an overall attractivity number that could be 
"decomposed" using flows to each one of several sites in an 
area to get absolute Cheung-type site-specific attractivity 
measurese (See TN 33e ) 


CONCLUSION 


The preceding may have glossed over what may prove to. 
be the most important point made in this article - that 
Cheung (see TN 1), following the lead of others, found it 
appropriate to suggest that the use of a site will be 
influenced by the number of sites around it and their 
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TABLE 4 


RANKINGS OF ATTRACTIVENESS 
_ OF TWELVE SASKATCHEWAN PARKS 
BASED CESARIO'S MEASURE* 
AND AN ESTIMATE OF IT 
BASED ON THE RELATIONSHIP: 


T(f£) = TCc)/(-126 + 27A(c)) 


Rank of Rank of 
Park T( f ) T(t) T(t) T(t) 
1 12 02155 9 0.345 
2 2 3-083 12 -2.-514 
3 4 "22609 2 22 006 
4 10 - 00477 5 0-778 
5 9 02666 $8 02-532 
6 6 02898 4 0.976 
7 5 1.2416 1 92861 
8 1 3e 846 6 Oe 661 
9 7 0-704 3 1.525 
10 8 02683 10 0.234 
11 3 2e886 7 0-610 
12 11 02373 11 0-156 
P = a particular park 
T(f) = estimated Cesario attractiveness factor for P 
T(c) = actual Cheung attractiveness factor for P 


ACc) = number of parks that are within 100 miles of P 
* Pank Correlation of T(f) and T(f) = 023 


OLOTOTOLS; 


relative attractivitye Work carried out after the first 
version of this paper was written (see TN 33) has shown that 
if the kind of effect that Cheung thought existed did in 
fact, exist, Cesario would estimate it as part of his 


attractiveness measuree Cheung's attractiveness measure 
obviously does not depend on what is around a parke So; the 
relationship between the Cheung and Cesario measure 


established here was to be expected on theoretical groundse 
In light of these findings, two tentative conclusions 
are clear3 (1) Ross? and Cesario's indices of 
attractiveness can reasonably be considered to measure much 
the same thing (though Ross in ordinal and Cesario in 
interval scale terms) and (2) Cheung measures something 
different from Ross and Cesario in that the does not measure 
the attractiveness of "general" destination areas, but 
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develops a site specific measuree 

An overshadowing concern in this paper has been that 
one must determine the attractiveness of a park in terms of 
the types of activities it offers to the visitors who are 
there for a given reasone Attractivities of “activity 
packages" for different types of users must ultimately be a 
focus of attentione A park may be very attractive to a 
certain group of people having a certain time budget, (and 
depending on their interests) while it may . be not at all 
attractive to another group of people, either because of the 
facilities offered or time constraintse In this analysis the 
visitors perceptions of attractiveness have been aggregated 
to some degree in considering main-destination day-users as 
a homogeneous group of users by considering that they use 
parks for a single reason and that all such users have the 
same “utility” scalee 

If a great value of site attractiveness studies Lies in 
providing insights ~— into the social psychology of 
participation in outdoor recreation much interesting work 
remains to be done on measuring attractivity, particularly 
as it relates to attractivity to whom, for whate It seems 
clear that Ross and Cesario have given important, new and 
highly related measures that can be used in pursuing 
research and planning ends related to measuring the 
attractiveness of recreation sites or areaSse 
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A COMPARISON OF THE REGRESSION RESULTS OF APPLYING 
TWO ATTRACTIVENESS FACTORS TO ESTIMATE PARK USE 


HeKe Cheung 
ABSTRACT 


In this paper, the results of using two attractiveness 
factors in regressions designed to estimate park use are 
comparede The two attractiveness factors thus compared were 
derived by two independent approachese Cheung’s measure was 
derived by applying a formula that relates attractiveness to 
characteristics of a park and popularity of activities that 
can be participated in within the parke Cesario's measure, 
in contrast, is defined by visitor flows from various 
origins to parkse 

The explanatory power associated with each 
attractiveness was found to depend on the particular form of 
the estimating equation usede This finding led to the 
conclusion that neither the Cheung, nor the Cesario, 
attractiveness factor performs better than the othere 


INTRODUCTION 
Cheung, in an attempt to explain the variance found in 
a set of Saskatchewan main destination day-user data (see TN 
1) developed the following model using multiple regression 


analysis: 


(1) VWOiegj) = 16393 + €120e31P(i) — 36e60P(i JACI) 
+ 12-2577 j) - 104.-56)/g(D(i,yj)) 


WHFRE 


VC i,j) = the number of vehicles in hundreds travelling to 
park j from origin i per season, 


P(i) = population, in thousands, of origin i, 

D(C i,j) = road distance in miles, from origin i, to park jy 
AC i) = alternative factor for origin i, 

@E 1/D(Ci,gk )**(1/2), DCiygk) £ 100 miles (sum on k) 

0 otherwise 


i= 1; eee? 231 
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j # k = 159 ecoy M and j refers to the park under 
consideration, 


T( j) = attractiveness of park j, and 


a(D(Ci,gj)) = Dligy )**1/2 0 < Dlisgj) < 20, 
e(PCiseJs)) = DC is Ji) 20 < DC i,sJ) < 55, 
e(DCisgj)) = DCieg j)**3/2 55 < D(C i,j) 
How Cheung defined and measured TC Jj), thell 


attractiveness of park jy, is not of immediate concern here > 
but it should be noted that T(j) explained less than one per 
cent of the total variance in VCigJ)» the dependent 
variablee Doubts were thus raised about the soundness of the | 


way T(j) was derivede The question was asked: Given the | 
same set of Saskatchewan data and the same functional form, © 
could an independent approach of measuring park | 


attractiveness explain more variance than did Cheung's? | 

The purpose of this paper is to answer that question) 
and comparison is made of the results obtained by using two 
attractiveness factors derived from the two approaches | 
designed to measure park attractiveness, one by Cheung 
(1972), the other by Cesario (1973 )-. 


THE CHEUNG AND THE CESARIO ATTRACTIVENESS MEASURES 


Traditionally, there have been two approaches to. 
defining park attractivenesse The first is of an inductive | 
nature and has so far been used more frequently than the 
second, which is a deductive approache In the inductive — 
approach specific components (elements) of attractiveness 
are designated, often subjectivelye They may consist of the. 
number of picnic tables, the length of a swimming beach, — 
water quality and so ony, depending on the type of users 
under studye Each component is given a score, based on some 
quantity or quality criteria and a numerical valuee The 
Nattractiveness of a park" is calculated by algebriacally | 
combining the scorese The Cheung attractiveness measure, 
used in the Saskatchewan study referred to in the. 
introduction, is an example of an inductively defined 
measuree | 

Briefly, Cheung took the attractiveness of a park with. 
regard to a particular use (eege, main destination day-use) 
to be measured by a weighted sum of scores for the day-use 
facilities offered at that parke Weights were defined on 
the basis of popularity of activities among the other 
considerations, but information on the day use of the park 
was not used in defining its attractivitye 

The deductive approach to defining a measure of park. 
attractiveness involves computing a numerical value. 
reflecting attractiveness of a park based on analysis of 


TN 28 CHAPTER III page 196. 


visitor flows (in this case day-use flows) that the park 
receivese Cesario has defined a way of estimating 
attractiveness measures using a deductive approach (see TN 
4)e So has Ross (see TN 2) although his measures’ are 
ordinale 


COMPARISON OF MEASURES 


The attractiveness measures as developed by Cheung and 
Cesario represent two theoretical constructs designed to 
determine the relative attractiveness of parkse Although the 
measures were developed using different approaches, they 
both were intended to produce a "formula" for calculating 
park attractiveness which a park planner or model builder 
could use to advantage. 

It was recognized that with the Cesario attractiveness 
measure available for Saskatchewan, it would be possible to 
use the data to compare the measures's "performance" with 
that of Cheunge So Cesario's method was used to derive 
attractiveness estimates for the twelve Saskatchewan parks 
for which Cheung had made estimates (see TNl)e To do so, it 
was necessary to apply Cesario's model (Equation 4 in its 
logarithmic form) to the Saskatchewan main destination day- 
use data used previously by Cheunge The Cheung and the 
Cesario attractiveness values of the parks under study are 
presented in Table le 

It should be noted that the absolute values of the 
Cheung or Cesario park attractiveness factors are not 
importante Rather, it is the relative values that are 
meaningfule For example, the relative attractiveness of Good 
Spirit provincial park to that of Moose Mountain provincial 
park is 0.68 (= 76.56/113e11) according to the Cheung 
attractiveness scale and 0662 (= e293/1-50) according to the 
Cesario attractiveness scalee (The Cheung and the Cesario 
attractiveness scales both have the properties of direction 
and magnitude and are said to be interval scalese ) 

After the two sets of attractiveness values were 
obtained, they were used in separate regression runs to 
explain the number of visits from different origin areas to 
the twelve Saskatchewan parks considered in this analysiSse 
Actually, three sets of regression runs using the 
attractiveness values and other regressors were madee There 
are, of course, an infinite number of regressions that could 
be made containing the attractiveness factore The three 
following functional forms (Equations 2, 3, and 4) chosen in 
this study are some of the commonly used functional formse 


(2) ECVCigg)) = COFCCIPC 1 )+C2PC( i ACA F+C3T( K)+C4 /a( DC iyg)) 
(3) EC VCiyg)/PCL)) = CS + (CCOACA) + CTTCK))I/g(DCisJj)) 


(4) EClog((VCi,g) + 1)/PCi,5))) = CB + C9 LogAli) + 
C10 Log(Di,j) + C11 lLogT(k) 
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TABLE i 


THE CHEUNG AND THE CESARIO ATTRACTIVENESS FACTORS OF 
TWELVE SASKATCHEWAN PARKES* 


Park Cheung Cesario 


Buffato Pound $6012 0-55 
Cypress Hill 45.26 2202 
Duck Mountain 126.40 1.52 
Echo Valley 122.05 «63 
Good Spirit 7656 i e93 
Green Water 61-46 1.06 
Prince Albert 88e75 1.46 
Moose Mountain 113.11 1.50 
Pike Lake 962-10 O6« 83 
Rowan's Ravine 59.01 0-65 
Battleford's 104.28 1-59 
Besant 26260 0-51 


as The Pearson product moment correlation ceofficient 
between the two sets of factors is 0«28-6 


€><9<9<595> 
WHERE 
T(Ck) = TC J) or T(f) 
TC j) = the Cheung attractiveness factor, 
T(f) = the Cesario attractiveness factor, 
CO, eee, Clil = parameters to be estimated, and 


E( ) = the expected value of the quantity in ( y's, 


The equations derived were: 


(CS) vVOisj) = 1633 + (120231P(i ) 
= 36e60P(i JACI t1625T0 j) — 104¢56)/g( D( i,j) )( Cheung) 


(Sa) Wlisj) = 1€32 + (116-54P(1) — 34-e60P(i JACi) + 106619T(£) 
—- 118.08)/g( DC i,j)) (Cesario) 


(6) VCisgj)/PCi) 


0.07+( 0.97T( j )-7056A( i ))/g( DC iyj ) (Cheung)! 


(6a) VOi,gj)/PC i) = Ooll + (7O638T( Lf) — 6633A01))/e( Dl i,j)) 


(Cesario) 


TN 28 CHAPTER III page 198 


(7) lLog((VCisj) + 1)/P(01i)) = 2-65 + 0-19 lLogT(j) - 
1-83 LogD(i,j)— 06581 LogA(i) (Cheung) 


(Ja) lLog((V0Cisgsj) + 1)/PC1)) = 3e10 + 1221 LogT(f) —- 
1-88 LogD(li,yzyj) — 0633 LogACi) (Cesario) 


TABLE 2 


STATISTICS ON THE REGRESSION COEFFICIENTS OF 
EQUATION 5* 


Regression Standard 
Coefficient Value Error F-Value Re 
constant 1633 
P(i)/e(DCi,sJj)) 120.31 5<80 429-280 0.8416 
1/e(Dli,j)) —-104.56 27-30 14-63 0.9088 
T( j)/g( DC is,Jj)) 1.225 0.40 9-85 0.9048 


* The F-value and the standard error of estimate of this 
equation are 562e93 and 7259 respectively, with 226 error 
degrees of freedome Also all regression coefficients are 
Significant at the one per cent probability level and all 
have the expected signse 


PO LOTOLOLS? 


In the first set of regressions (see Tables 2 and 3) 
both the Cheung and Cesario attractiveness factors, when 
combined with the explanatory variable defined by the 
distance function g(D(i,j)), did almost equally poorly in 
explaining the variance found in the Saskatchewan day-use 
datae The reason may be that the regressors P(i)/g((DlCi,j)) 
and P(i)ACi)/g(D(i,j)) explained so much variance (about 90 
percent ) in the day-use data that there was not much 
variance left for the attractiveness regressor to explaine 

In the second set of regressions (see Tables 4 and 5) 
the Cheung attractiveness factor, when combined with the 
distance function g(D( i» j))>, explained 69 percent of the 
total variance in the datae On the other hand, the Cesario 
attractiveness factor, when combined with the same distance 
function, explained 66 percent of the total variancee The 
reason why the attractiveness regressors had so much 
explanatory power when the dependent variable was defined as 
participation rate is not cleare The author's conjecture is 
that it may have something to do with the constant variance 
assumption implicit in ordinary least squares (OLS) 
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regression that was used in this studye Certainly, in the | 
Saskatchewan main destination day-use data, large flows were — 
associated with large population centerse Thus, when the | 


flows were "weighted" by the corresponding populations, the 
homogeneous variance assumption of OLS was more nearly met 


than when they were not weightede (On this problem, see TN 


19 )e 
TABLE 3 
STATISTICS ON THE REGRESSION COEFFICIENTS OF 
EQUATION Sa* 
Regression Standard 
Coefficient Value Error F-Value R2 
constant 1.232 
P(i)/e(DCisJj)) 116-54 5e61 430.99 e8416 
PC i)JACi)/a@(DCi,gj)) — 34-06 3e02 127.62 ©9031 
1/e(D(i,Jj)) —-118.08 31242 14.213 93049 
TC f)/e@( DC i,j)) 106-19 34e13 9-68 e9089 


* The F-vwalue and the standard error of estimate of this 
equation are 563e37 and 7.59 respectively, with 226 error 


degrees of freedome Also all regression coefficients are 
Significant at the one per cent probability level and allt 
have the expected signse 


OL OLO LOLS, 


It is seen from Tables 4 and 5 that the Cheung 
attractiveness regressor, T(j)/ga(D(Ci,j)), had a better 
overall performance than the Cesario attractiveness 
regressor, T(f)/g(D(Cisgj)), in terms of a larger ratio of the 
regression coefficient to its standard error (e97/-06 = 
16617 versus 70-38/4e42 = 15292), and a higher increase in 


the R*® (26862 versus e66e2)e It is also seen that Equation 6 
incorporating the Cheung attractiveness regressor has a 


smaller standard error of estimate than the Equation 6a 
incorporating the Cesario attractiveness regressore 

In the third set of regressions the functional form of 
the estimating equation was double-Logarithmice This time, 
as seen from Tables 6 and 7; there was a decrease in 
explanatory power of the regressor containing the Cheung 


attractiveness factor and the Cheung attractiveness 

regressor was not found to be significant. There was, | 
however, a marked improvement in the explanatory power of 
the regressor incorporating the Cesario attractiveness 
factore This was to be expected, since the Cesario. 
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TABLE 4 


STATISTICS ON THE REGRESSION COEFFICIENTS 
OF EQUATION 6% 


Regression Standard 

Coefficient Value Error F-Value Re 
constant 0.07 

TC j)/ae(DCi,j)) 0.97 0-06 297 ¢ 22 ©6862 
AC i)/ al DCiszJ)) —-7 256 2098 6641 ©6947 


* The F-vwalue and the standard error of estimate for this 
equation are 259246 and 1-232 respectively, with 228 error 
degrees of freedoms Also all regression coefficients are 
significant at the one per cent probability Level and all 
have the expected signse 


OS OLOLOLS, 


attractiveness factor was derived using an Equation that is 
very similar to Equation Va, the estimating equatione 


SUMMARY AND CONCLUSIONS 


In this paper two methodologies designed to measure the 
attractiveness of a park have been evaluatede The Cheung 
attractiveness measure was based onan ad hoc definition 
procedure developed by an inductive approach so that an 
overall rating of a site could be arrived at by considering 
a set of site characteristics and services offerede The 
Cesario attractiveness measure was a component of a trip- 
making model, the number of trips made from an origin to a 
destination considering spatial operatione It is estimated, 
based on a function of the Characteristics of the origin, 
the characteristics of the destination, and the spatial 
seraration of the origin and the destinatione Cesario's 
measure of attractivity was thus described as being defined 
deductivelye 

Using Equation 2, 3, and 4 to compare the effectiveness 
of the two attractiveness factors, using the regression 
results, and based on using the increase in the R? value as 
the main criterion for judging the performance of the Cheung 
and Cesario attractiveness factors, it is difficult to say 
whether one performs better than the othere The regression 
results presented showed that the efficiency of the 
attractiveness factors depended on the particular form of 
the estimating equation usede One can see the problem by 
noting that when the dependent variable was defined as 
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TABLE 5 


STATISTICS ON THE REGRESSION COEFFICIENTS 
OF EQUATION 6a* 


Regression Standard 

Coefficient Value Error F-Value R2 
constant Oeil 

T(f£)/e(DCiej)) TFTO00.38 4.242 253-71 «6610 
ACi)d/ae(DCi,gj)) ~-6-33 3e12 4e1i11 e 6670 


* The F-value and the standard error of estimate of this 
equation are 228253. and 1e38 respectively, with 228 error 
degrees of freedome Also all regression coefficients are 
Significant at the one per cent probability level and all | 
have the expected signse 


OS OZOLOLS: 


participation rate, the Cheung attractiveness regressor, © 
defined as Tlc)/@(D(isj)), slightly outperformed the Cesario 
attractiveness regressor, defined as TC £)/ eC DC iggy) de 
However, when the dependent variable used was loa((V( i,j) + 
1)/P(i)), the Cesario attractiveness regressor, defined as 
log T(f),y greatly outperformed the Cheung attractiveness — 
regressor, defined as log Tlc), in the sense that the former | 
explained more than ten percent of the total variance | 
whereas the latter explained less than one percente 
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TABLE 6 


STATISTICS ON THE REGRESSION COEFFICIENTS 
OF EQUATION 7* 


Regression Standard 

Coefficient Value Error F-Value R2 
constant 2e65 

log D(i,j) —-1.282 0.09 411.78 0.6385 
log A(i) —-0.58 Oeil 27-30 0.6791 
log T( Jj) 0.219 0214 1284 0-68-7 


The F-value and the standard error of estimate of this 
equation are 162-06 and 0e39 respectively, with 227 error 
degrees of freedome Also all regression coefficients are 
Significant at the one per cent probability level except 
that of log T(j) which is not significant, and all have 
the expected signSe 


TABLE 7 


STATISTICS ON THE REGRESSION COEFFICIENTS 
OF EQUATION 7a* 


RPegression Standard 


Coefficient Value Error F-Value Re 
constant 3e10 

log D(i,j) -1.88 0-08 562-18 ©6385 
log AC i) —-0.e33 0210 10.05 ©7389 
log T(f) 1.02 0e13 64. 30 ©7500 


* The F-value and the standard error of estimate of this 


equation are 226-95 and 034 respectively, with 227 error 
degrees of freedome Also all regression coefficients are 
significant at the one per cent probability tevel and all 
have the expected signse 


OS OLOLOLS, 
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CORD STUDY TECHNICAL NOTE 27 
PERCEPTION OF QUALITY OF WILD RIVERS 


Pe Juurand, Ve Guzelimian, and Je Beaman 


ABSTRACT 


A group that was in the organization that was the 
Planning Division of ‘Parks Canada in 1972 carried out Wild | 
Rivers Surveys to develop a technique for designating rivers 
with National wild river potentiale The surveys had the 
objective of developing an input to the systematic selection | 
of National Wild Riverse However, this project also. 
resulted in the possibility of learning something about how | 
expert canoers'’® ratings of sites on wild rivers relate to. 
the “resources" at those sitese | 

Understanding how “resource" variables are retated to 
the perceived quality of a given segment of a river, a site, 
was pursued by determining how well an average site score 
for the 4 persons who rated each site was explained by each 
of three modelse The simplest modet considered a 
multivariate regression model in which only interval 
variables were usede The second model was the more general | 
ANOVA, analysis of variance model, which allowed for nominal 
variables and non-linear relations between each resource. 
variable and the dependent variable, site qualitye Finally, | 
the most general model used was the Michigan AID, Automatic. 
Interaction Detection model (the computer program produces a 
"model" )e The three analysis explained 38%, 64%, and 84% of 
the variance in average site quality scores respectivelye i 

Statistical tests are presented to show that the 
improved results based on the different analyses of 212 Wild 
River sites on which data were collected in 1972 did not 
occur by chancee The conclusion is that the improvement from. 
model to model is unquestionably real! | 

Two types of implications of the analysis are pointed 
oute The first type of conclusions are methodologicale The 
other has to do with the planning use of modelse The 
conclusions suggest very limited value in using "perceived" 
site quality models for planning because of problems of site 
use for what, by whom, under what conditions, etce The paper. 
makes it clear that the comments’ do not refer to the merits. 
of developing and using “engineering"™ and "biological" Land 
capability models but to using models of human perceptione 
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PURPOSE 


This paper attempts to answer basic questions relating 
to the use of models in order to show how resource variables 
are related to the qualities assigned by a jury of expert 
canoeists to specific sites ona number of Canada's Wild 
Riverse 

From a social-psychological perspective the concern is 
with, for a given type of site and user: (1) what variables 
influence the decision that a site has a given quality, and 
(2) how a person's decision on the quality of a site is 
influenced by the ranked importance of physical, historical 
and biological variables and in what manner do the 
combination of the judgments on each variable produce an 
overall judgment of a site's qualitye 

From a mathematical point of view the question is 
whether a simple linear model or amore elaborate model is 
required to explain the Wild River Site quality ratings. 

From a parks planning viewpoint, the concern is: (1) 
to demonstrate how to determine which resource variables are 
important in the site quality rating, (2) to comment on 
whether or not resource information (natural, historical and 
biological) can be used in defining the quality of an area 
that will be perceived by a given type of user, and (3) to 
shed some Light on whether or not site quality estimates can 
be obtained with enough social-psychological and statistical 
confidence that they can reasonably be used in planninge 


THE PROBLEM 


Any discussion of the attractiveness of Landscapes 
raises the problem of how a person reacts to resource 
variables when judging the quality of a sitee To pursue but 
one example, the Shafer et al ( 1969) study (see Reference 
19) indicates that people do not react to resource variables 
in a linear waye Shafer's models, by their structure, imply 
that they react independently to different resource 
variables and that their responses to different variables 
combine additivelye 

Do people in the real world react to situations by 
mentally adding up the effect of each resource variable to 
get a "composite" quality rating? Obviously, they do not do 
it consciously, so it is necessary to consider the 
possibility that when an individual reacts to a site, that 
reaction is (firstly) not necessarily linear or curvilinear 
and (secondly) not necessarily defined by independent 
reactions to each of several resource variablese 

From a mathematical perspective, the preceding ideas 
can be dealt with by accepting that the perceived "quality" 
of a site on a wild river (in relation to the use of the 
river for a given purpose) is determined by an individual in 
a rational, predictable and statistically reproducible waye 
Accepting this statement acknowledges that the following 
type of equation may be used to define the quality of a 
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a(s) = FCX(1), XC2)_ XCB)— eoooy X(n)) +t EC p) 


WHERE 
a(s) = Quality of a particular site, s, as perceived 

by a given type of person, for a given purpose; 
F( ) = some function; 
X(i) = the score of site 3s on resource variable i; and 
E(p) = a unique "error" related to variability in 


perception, with (p) being a subscript that 
refers to a particular rating by a eat person 
on @ given occasione 


Whether or not the rating of a site is made ina linear way 
is then a matter of . determining whether F( ) is a linear 
function of X(1), X(2), etce And if decisions are made in 
other ways, by determining if F( ) is a different kind of 
function of the resource variables, is another matter that 
must be resolvede 


THE DATA 


As shown in Figure 1, the rivers for which data were 
collected in the 1972 Parks Canada Wild Rivers Survey were 
those flowing through the Mackenzie Mountain section of the 
Western Cordillera, the Barrenlands and Tundra Hills of the 
Canadian Shield, the Boreal Uplands of Saskatchewan, the 
Laurentian and east coast regions, and the rivers in the 
Appalachian mountain system of Newfoundlande They were 
studied by crews of two 2-man canoe teamse Four crews were 
in the field in 1972. 

The data were collected from sites on the rivers where 
major or minor changes in the river and the river valley 
environs were observed to take placeée Major changes were 
considered to be those where a tasting change took place 
(e€ege from a V-shaped straight valley to a broad flood plain 
with a meandering channel )3; minor changes were defined as 
spot locations of scenic, historic or cultural intereste 
(See the List of References for survey reportse ) 

A survey site was a 200-to S00-yard reach; or two to 
three stream-widths of a river that best illustrated one of 
the following: 


Upstream starting point 
Change in water pattern: rapids to slow moving water 
Change in water pattern: slow moving water to rapids 
Change in valleys: from flats to canyons 
Change in valley: from canyons to flats 
Change in river or valley caused by intersection 

with major river 
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Mouth of river 

Points of historic interest 

Major resource developments or townsites 

Spectacular or scenic sites that do not fall into 
the above categories (eege waterfalls, a particularly 
attractive bend in a canyon, etce ) 


When collecting data, crews were instructed not only to. 
note the reason for choosing an area aS a sample site but 
also to record the type of sample. Secondary and tertiary) 
reasons for choosing a site were also to be recordede Crews 
were also required to record the number of miles between 
sites as an indication of the frequency of changes along the 
course of the rivere | 

The variables chosen to describe site characteristics. 
are shown in Figure 2e The codes for some of them are given 
in the Appendix but other variable values are not shown 
there because they can be read from the Inventory Coding 
Form also given in Figure 2e Of all the variables Listed on. 
the Inventory Coding Form, only stream order and sinuosity 
were found to be interpreted inconsistentlye They were not. 
coded and therefore do not appear in the Appendixe | 

One change between the 1971 Wild River Survey and the 
1972 survey was the recording of site ratingse Each crew 
member rated each site subjectively on a 10-point scalee The) 
average measure for a site is the dependent variable 
describing site quality used in the analysise Although | 
individual crew members had varying backgrounds and tastes, 
all were expert canoeists with extensive experience in) 
wilderness and river environmentse Thus their ratings, while 
varying according to personal preference, can be expected to. 
reflect a homogeneity of judgment due to similar to, and 
interest in, extensive wilderness travele | 


ANALYSIS 


In the following, three terms are used to describe data . 
analysis approachese *Simple regression refers to a 
"standard" regression analysis in which both the independent 
and dependent variables are continuous (interval) and the. 
dependent variable is explained by a mean plus a sum of 
regression coefficients times their respective variablese 
* ANOVA? (analysis of variance) is used in a fairly well 
accepted way to refer to an analysis where the independent | 
variables are nominal, having values such as married, 
Single, divorced, rather than being intervalse ( See. 
Reference 20.) In this analysis, “effects" for each value of 
each variable are calculated (see Figure 2)e A very simple | 
illustration of the kind of results obtained, using this 
analysis method, is available in the CORD Study TNs Noe 12) 
and 15.2 Finally, ‘AID? (see Reference 21) is based on a 
computer program that performs a search for a structure in. 
data in a more general way, using less restrictive 
assumptions, than does ANOVAs (For an example of the use of 
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River Name 
Site Name 


Natural Region 


Stream Order [cal 


1 2 
Sample Type erie Magia 


oak 


Mean Depth (feet) por 
Gradient (ft./mile) ay li 


Water Temperature (°F) rage) 


Width of Valley Flat (feet) me TT 


Channel Pattern 1. 
Stream Bed 1 
material 

Water Pattern 1% 
Flow Level U- 


Predominant 
Fiuvial Process 


NPC 512 (5-72) 
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River Width Low Flow 
(feet) Bank Full 


Drainage Area (sq. miles) A er 
Sin dsity ee 
Flow Variability ee 


Straight 
* Channel 


Rock or 
* Boulders 


Torrent or 
* Waterfall 
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i 


AID, see TN 4-) The authors are not concerned with the fact 
that regression analysis can be employed to carry out an 
analysis of variance by what is often called the dummy 
variable approache By ‘simple regression'* they mean that no 


nominal variables or interaction effects are built into a 


model. Similarly, by analysis of variance they mean 
straightforward analysis of varianceée 
Since use of linear regression analysis of variables 


) with more than two values requires that variables be 


interval, only interval variables having a logical bearing 
on landscape preference were selected for analysise The 
variables that could be used were correlated, and some of 
those exhibiting high correlations with other variables (r 2 
8) were eliminated by keeping only one variable of a set of 


highly intercorrelated variablese 


After this screening a multiple regression analysis was 
carried oute Using the average crew ratings as the dependent 
variable and the "dimensions" of the environment listed 
below, it was possible to determine relationships between 
the quality measure and the selected environmental 
variablese The regression model derived is given by the 
following equation, for which standard errors in the 
regression coefficients are shown in parentheses: 


Y = 2-686 — 0-19 X10 — 04618 X12 — 0-05 X22 —- 0-08 X24 
(217) (2-08 ) (215) (08) 


—- 0603 X25 — O0¢51 X29 — 0652 X33 — 0602 X37 - 0217 X39 
(e11) (e14) (220) (-01) (2-07) 


- 00632 X54 —- 0.05 X56 — 0635 X58 — 0646 X60 —- 06425 X61 
(216) (210) (214) (e211) (214) 


WHERE: X10 is the mean depth of the river; 
X12 is the gradient of the river at the site; 
X22 is the velocity of the river; 
X24 is the coarseness of the stream bed material; 
X25 is the degree of turbulence on water's surface; 
X29 is the degree of artificial channel control 
(recognized aS a questionable variable to be 


considered as interval )3; 


X33 is the angle between horizontal and highest 
visible point; 


X37 is the height above the river of the highest 
visible point; 


X39 is the angle between horizontal and highest 
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visible point; 


X54 is the amount algae (recognized as a questionable | 
variable to be considered as interval )3 


X56 is the number of mammals; 
X58 is the number of fish; 
X60 is the density of land flora; and 


X61 is the diversity of land florae 


The analysis of variance presented in Table 1 shows 
that the regression resulted ina significant relationship | 
with a F = 10622, which has a probability of less than -005. 
of occuring by chancee The R*® value obtained was e38- The 
F-test clearly indicates that the null hypothesis of no 
relation must be rejectede But the R*, while acceptable | 
mathematically, hardly suggests that the model is usable for 
planning purposese In descriptive terms, the low value of R@. 
means that a prediction of site quality has a high 
probability of being very much in error (being high when it 
Should be low and vice versa )e | 


TABLE 1 


ANALYSIS OF VARIANCE TABLE 
FOR THE SIMPLE LINEAR REGRESSION 


Source of Degrees of Sum of Mean 
Variation Freedom Square Square 
Regression 14 415.667 26 69 
Residual 225 653-46 2-90 
F-Ratio = 10e22* 


* significant at e005 Llevele 


<> <> <2<9 <> 


Turning to analysis of the data using analysis of 
variance, it should be noted that the following equation was 
assumed to be appropriate to explain site quality ratings: | 
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(Quality of a River Site i) = 


General mean + gradient effect at site i+ valley 
width effect appropriate to site i + bank width 
effect + velocity effect + stream bed material 
effect + water pattern effect + fluvial process 
effect + artificial control effect + access effect 
+ land use effect +t utilities effect + historic 
sites effect + height of highest point effect + 
vertical view confinement effect + horizontal view 
confinement effect + downstream view effect + 
upstream view effect + pollution effect + water 
colour effect + turbidity effect + floating 
material effect + algae effect + plants effect + 
flora effect + flora density effect + flora 
diversity effect + mean depth effect. 


Expressed differently, Equation 2 shows that the effect 
for each level of the variables that apply to a given site 
is added to give a predicted quality as follows: 


(2) vid = U + B(1,i) + B(2,Jj) + B(3,k) + ese 


+ BOCLem) + eee + B(n, 61 ) 

WHERE Y(i) = the quality rating of site i 
U = the general level of site quality 
BC 1i,i ) = the effect of level i of variable i 
B(2,/j) = the effect of level j of variable 2 
B(3,k ) = the effect of level k of variable 3 
BC Lym ) = the effect of level m of variable L, 
B(n,61) = the effect of level n of variable 61. 


A computer program was used to perform the generalized 
analysis of variancee The R*® value for the analysis was 
e594e The b's for Equation 2 are Listed in the Appendix, 
which gives the names of all variables and the values that 
they were allowed to take along with the constant Uy, the 
general meane The coefficients listed in the table are often 
called the beta coefficients of a particular tlevel or value 
of a variable: in statistics one often uses tevel whereas 
in the SPSS computer program one refers to variables and 
their value tablese 

Figures 3 through 5 were prepared by plotting the beta 
values for each level value of the variables used to explain 
site qualitye For example, looking at the turbidity 
variable, one sees that the bar showing its value is close 
to 0 for level 1, extends above O for level 2, and drops 
below O for level 3e These results show that for higher 
turbidity Lewels (2 rather than 1) the turbidity effect is 
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highere However, when turbidity is 3, the turbidity effect | 
is lower than for a turbidity of 2e The respective effects | 
are e052, e320, and —-.-372 as indicated in the Appendixe 

When the Largest (positive) value for turbidity, e320, 
is added to the general mean for quality, 6e167, the | 
perceived site quality score is 6e487. When the Lowest beta 
value for turbidity —-e372 is added, the score is 6e7956 
Thus, recognizing that the mean quality score can vary from | 
1 to 10, it is apparent that Little change in this score 
results from considering the turbidity effecte So it is 
reasonable to say that the variable has a small effect on 
Site quality scores, a point which becomes more meaningful 
in a comparative sensee A relatively targe change in the 
general mean is associated with the variable "highest 
point". A flat terrain should have a low beta valuee Level 1 
of highest point, has a beta value of —--926 associated with 
tS level 4 of highest point (the tlevel indicates at least 
one feature projecting high above the water Level) has a 
beta value of e412e When each of these scores is added to 
the general mean, the two values obtained are 5e241 and 
620167 respectivelye Thus it is seen that the highest point 
variable has a much larger effect on site quality scores 
than the turbidity variablee 

In contrast to the foregoing analysis techniques (in 
which regression coefficients or effects are calculated), 
the AID technique is a multivariate method of analysis used 
to classify data into homogeneous groups, called terminal 
clusters, on the basis of the value of a dependent variablee 
Given an interval value dependent variable and a specific 
set of nominal (possibly ordinal) independent variables, an 
AID analysis indicates (1) which independent variables may 
be considered to explain most of the variance in the 
agependent variable, and (2) which levels of independent 
variables account for the variance explainede 

Two AID analyses were performed on the Wild Rivers 
datae The first run provided an analysis for the purpose of 
illustration onlye The second, with the same number of 
deprees of freedom as ANOVA, was then performed to obtain 
AID results which are comparable to ANOVA results because 
both had the same number of degrees of freedome (On 
comparing AID and ANOVA, ' see TN 20-2) The R?#'s were 
respectively e74 and e¢«85e The former analysis produced 
fifteen terminal groups, the Latter produced forty-fivee 

Figure 6 presents the results of the AID run which 
produced fifteen groupSe The various steps of the analysis 
resulted in the tree diagram showne The first step was to 
compute the mean site quality score of alt 240 sites, in 
this case M = 66176 This parent group, Group i, is indicated 
by 1,240 at 6e17 on the site quality axise The independent 
variable that accounts for the most variance in the 
dependent variable (site quality score) was then chosene 

Whenever AID is used, the independent variable that 
accounts for the most variance is chosen on the basis of 
variance explained in a dichotomous splite In this case the 
original 240 observations in Group 1 were divided into 
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Groups 2 and 3 in such a way that there was the greatest 
possible difference between their site quality scores in 
terms of between-group variancee At the right-hand side of 
Figure 6, at splitting tevelt 1, the independent variable 
"flora density" is shown because it accounts for the largest 
amount of variance in site quality scorese When Group 1 was 
divided on the basis of the flora density values, Groups 2 
and 3 were formede Group 2 centains observations with 
levels 2, 3, 4 and 5, of flora density and Group 3 contains 
Level i (as indicated in the right-hand column of the 
Figure)e The tree diagram is read from top to bottome The 
brackets notation (X,Y) gives the variable on which a split 
has taken place to form Group X, which contains Y 
observationse By reading the X's, one may determine the 
order of splitting and thus, in some sense, the independent 
variables! relative power to explain variancee The position 
of the (X,Y) for a group on the site quality scale indicates 
the mean value of the dependent variable for the Group Xe 
For example, as the Figure shows, Group 2 with a mean of 66 
is made up of i187 of the original 240 observationsSe 

When a further step of splitting is considered, the 
largest amount of vwariance within groups that may be split 
is taken as the criterion for selecting that group: the AID 
program computes the amount of variance (within each group 
that may be split) about the group meane Referring again to 
Figure 6, Since Group 2 has more variance to be explained 


than Group 3 (Groups 2 and3 are the only candidates for. 


splitting at step two), it is divided to form Groups 4 and 5 
on the variable “water pattern". This splitting is done the 
same way as for Group 1: water pattern was the variable 
that would explain the most variance in Group 2 based on 
dividing the Group into two partse Groups 3, 4 and 5 are now 
candidates for further splittinge Group 5, having the most 
variance to be explained, is split to form Groups 6 and 7 
using the variable “water colour". 

This process of splitting continues until a cut-off 
point is reacnede These points are predetermined criteria 
that indicate that a particular Group available for 
splitting should not be splite Termination of splitting 
results in terminal groupse Group 4 underwent no further 
splitting because of the cut-off rule, and thus it is a 
terminal groupe Other terminal groups are Groups 9, 10, 19, 
225 etce appearing at the ends of the branches of the AID 
treee (For details, see TN 4e) 

An AID analysis of the Wild Rivers data produced Group 
19, which has the Lowest site quality score (le4) in the AID 
tree, and Group 4 which has the highest score (8e3)e These 
two sites had a Similar water pattern of torrent or 
waterfall (X25 = 5) but the variable "flora density" made 
the difference of scores of i1e4 and 8e3 on the site quality 
scalee Group 192 has a thin flora density white Group 4 has 
denser florae Otherwise the sites in clusters 4 and 19 are 
Similare 

Figure 6 also indicates that fourteen of the twenty— 
seven indepencent variables were Significant in the 
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assessment of site quality in that they were used in forming 
the AID treee Furthermore, because of its position on the 
tree, flora density may be considered a more prominent 
variable than turbiditye To the extent that the canoeists!’ 
aesthetic judgment is simulated by the AID trees, an 
understanding of the elements of the decision-making process 
is achieved and the AID tree lays out the logic behind the 
canoeists! definition of site qualitye For this reason, the 
AID computer analysis may be said to replicate the tree of 
logic that a conoeist follows in defining site quality on 
wild riverSse 


DISCUSSION 


Earliery it was pointed out that only interval 
variables having a logical bearing on landscape preference 
were used in the simple linear regression analysise It was 
found that the interval variable of the linear regression 
model could be used to describe a linear relationship 
Significant at the 2.005 tlevel and explaining 38 percent of 
the variance in the Wild River datae On the other hand, with 
the ANOVA model, it was found that allowing for a 
curvilinear relationship and for the inclusion of nominal 
variables explained 59 percent of the variance in the site 
ratingse 

In both models, the fourteen and twenty-nine parameters 
were estimated from 241 observationse Since the simple 
Linear model has an R* of e38 and ANOVA is a generalization 
of the Linear model, the ANOVA analysis has a larger R@, as 
expectede This R®° of e59 indicates that by introducing 
fifteen new parameters, 34 percent of the residual variance 
was accounted for: 


(R@( ANOVA) — R@C( LINEAR) )/(1 — R®(LINEAR)) X 100% 
= (2-58 — e38)/(1 — -38) X 100% = 34% 


But is the increase in R@ only due to increasing the number 
of parameters estimated? The percent of the remaining 
variance that can be expected to be explained by chance is 
15/( 241-14) = e066 (about 7%), which is: 


Noe of degrees of freedom 
introduced to explain remaining 


variance 
%® of variance expected = ----------"---3--\-\.3.0300--——0->->-->>-> 


to be explained by chance Noe of degrees of freedom 
of variance that remains 
to be explained 


So the inclusion of an additional fifteen parameters would 
be expected to explain approximately 7 percent of the 
remaining variance by chance rather than the 34 percent 
actually explainede But this could still occur by chance, 
and so an F-test is useful to determine whether the increase 
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in variance explained is statistically significante An 
appropriate F-ratio to test the significance of the 


increased variance explained is: 


F = (1/(B — A)CSSEC ANOVA) — SSEC LINEAR) ))/((1/(241 ~ B)) 
RSS(C ANOVA)) = 7212 


WHERE 

A = degrees of freedom in a linear regression model 

B = degrees of freedom in an ANOVA model 

SSE = the sum of the squared deviations around the mean 


of the site quality score explained by a model 


RSS the sum of the squared deviations around the mean 


not explained by a modele 


The F-value 7e21 is significant at the -01 level. Allowing 
for nonlinearity by the inclusion of the fifteen nominal 
variables in the analysis results inan explanatory power 
Significantly higher than chance would producee 

To summarize the preceeding discussion shows that one 
model may appear to be better than another because (1) it 
has more freedom to fit the same data, or (2) the better fit 
has occurred only by chances The first condition is caused 
when a Large number of parameters are estimated: the second 
is the result of random errore But, the 34 percent increase 
in explanation when more parameters were introduced is 
(statistically) significantly higher than the 7 percent 
increase that would be expected by chance, and so both (1) 
and (2) can be rejected as explanation and the ANOVA model 
must be accepted as better (more structurally appropriate to 
the data) than the simple regression modele 

Similar consideration must be introduced in comparing 
AID and ANOVA, but one interesting difference existse The 
AID model can be (and was) set up in such a way that it had 
the same number of parameters as the ANOVA modele Using the 
equation for increased variance explained, it is evident 
that the AID model explained the 63 percent that was not 
explained using the ANOVA model: 


(R®( AID) — R*®( ANOVA) )/(1 — R®( ANOVA) X 100% 
= (e85 — e59)/(1 — 59) X 100% = 63% 


If, as before, one performs an F-test using the equation 


F = (1/(B —- A) SSE(C AID) — SSE( ANOVA))/ (1/0241 — B) RSSC ANOVA )) 


a problem is encountered: the number of parameters in the 
AID model is equal to the number of parameters in the ANOVA 
modele Therefore i/B-A involwes division by zeroe To avoid 
this problem B — A can be arbitrarily set to 1 (or even 2 or 
3 if there is a desire to make the test very conservative )e 
When this is done, the F-ratio is found to be 375e93, which 
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is significant at the e001 level with 1 and 212 degrees of 
freedome (For other comparisons of AID and ANOVA results, 
refer to TN 202) Thus the AID model fits the Wild River data 
better than do either the ANOVA or Linear modetse And this 
is true in a statistically highly Significant way, 
predicting a given site's attractiveness for a certain 
purpose with more confidence than can be done with either of 
the other two modelse 

The results not only show that AID is the best model, 
ANOVA next and the simple Linear poorest, but they confirm 
the conclusions of previous investigations and adda number 
of new dimensionse For example, the importance of vegetative 
cover in the evaluation of Landscape photographs is also 
implied by the results obtained by Shafer (Reference 19), 
Rabinowitz and Coughlin (Reference 17), and Calvin et atl 
(Reference 2)e Further comparisons involve too elaborate a 
commentary to include here since the factors used in 
describing landscapes differ from one study to anothere 

Probably more important than confirmation of past 
findings is the fact that these studies allow researchers to 
Say something about the structure of the decision made when 
a site is rated highly by a given individuale That the AID 
model (which puts the reaction of a site in the context of a 
collection of variables) is the most accurate model for 
describing the perceiver's decision-making process is proof 
that people do not react to individual resource variables 
and then build some total score for a sitee Acknowledging 
the superiority of the AID model in explaining site 
attractions adds a meaningful dimension to the study of site 
attractivitye (Cesario used it to study park attractions in 
TN 4.) 

To elaborate on the methodological complications of the 
preceding, there is often not enough care taken in 
distinguishing different uses of general programs such as 
the AID Program or regression programse For example, it is 
perfectly valid to use a linear regression program to obtain 
the coefficients (parameters) that define a linear function, 
regardless of the number of data points, as long as that 
function is truly Llineare Similarly, examination of the 
clusters determined by the AID Program tells us something 
about how the decisions that ranked various sites were made 
by the canoeists, even though only 240 observations were 
usede Group 11, for example, consisting of forty-five 
observations, was split to form two groups with means 1e4 
and 4e052. In this case it is claimed that the groups formed 
are relatively thomogeneous clusters having truly different 


Site quality scorese Some may argue that is not 
statistically sound to use AID analtlsis to split a group 
containing forty-five out of 240 observationse However, 
further statistical examination indicates that a real 


structure has been found in the data: there is a difference 
of almost three units between the means of the two newly 
formed groups and there is a very low probability that this 
occurred by chancee 


Some of the final splits presented in the paper make it 
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questionable whether anything new was learned about the 
rankings of sitese It is felt here that researchers need not 
be bound by criteria that suggest that either information 
must be available on 2,000 sites or AID must not be usede 
The important point is that if a researcher decides not to 
use AID simply because of a relatatively small sample size, 
and uses some linear technique such as ANOVA to tLook for 
structure, he may find it with an R® of e64 which looks 
good. But suppose that the reason for obtaining the model is 
to use it in making predictions in a planning exercise? Even 
an R® of e84 is not particularly good if one is to put much 
faith in predictionse What is worse is that if the structure 
of a model is not really appropriate to the data (as would 
be the case with the ANOVA ) predictions will be 
systematicaltly in error so that some types of good locations 
may be regularly underrated and bad ones overratede Even if 
an analysis is used for strictly academic ends (except for 
teaching purposes) it is futile to derive a simple 
regression of an ANOVA model to explain data when these 
models are not appropriatee 

In sum, the superiority of the AID analysis means that 
interactions between resources must be taken into account 
when formulating models to explain the ratings of the Wild 
River Sites considerede 


CONCLUSION 


Accepting the straightforward conclusion reached (that 
the AID model is superior to the other models considered for 
explaining perceived quality of sites) and if other research 
supports these findings, then interaction must be considered 
in developing meaningful models of how people react to their 
environment in terms of assessing the attractiveness ofa 
given settinge But even the preceding statement fails to 
emphasize the importance of quality or attractiveness "to 
whom and for what purpose." [t is unlikely that a wild river 
canoeist will rate a site highly if the river has little 
gradient: he wilt prefer the excitement of changing water 
conditionse A fisherman who uses a motorboat, on the other 
hand, may prefer a river without barrierse Thus there may be 
homogeneity of response among wild river canoeists and large 
variations across different river userse An individual's 
reasons for being at a certain site determine to some extent 
the rating of the site in terms of scenic value and there is 
no suggestion here that the ratings by the canoeists are 
generalizable to a population of river userse 

This has clear implications for park plannerse Within a 
parks planning framework, concern is with whether a site 
quality measure defined by field survey crews can be 
explained by resource variables for the sites studied (ecege 
natural, cultural, etce )e If site quality can be explained 
with a high enough degree of accuracy by a function ot a 
number of resource variables, it is possible to predict the 
quality of various areas by using only resource informatione 


TN 27 CHAPTER III page 223 


The AID model can be used for predictions (see TN 4) and an 
R? of e84 suggests that they would be quite goode Thus at 
first sight it appears that there are planning applications 
of the AID model worth investigating, particularly so in an 
era when remote recording techniques can be used to produce 
relatively inexpensive resource data that can be readily 
processed by computerse 

The “catch" is that the model deriwed gives quality 
estimates for a particular type of user at a site fora 
particular purposee Quality values for a number of types of 
users could, of course, be computed but unless there is 
almost total agreement between quality measures there is the 
problem of how to get an overall intangible assessmente (See 
TN 25-) Unfortunately a compromise quality may not satisfy 
any of the users of a sitee So, it must seriously be asked 
if it is only engineering type assessments (for example, 
impact on land due to use) that are worth generating using 
computerse ‘ 

Certainly some of the modetling techniques now in vogue 
for capacity and impact analysis (ones which involve people 
stating their model for the social capacity of an area) are 
brought into serious question by the quantitative results 
presented heree An ANOVA model has been shown to be 
inappropriate to explain expert canoeists'’ perceptions of 
site qualities on wild riverse Even if an ANOVA model were 
appropriate to assessing composite quality to a party that 
is at the site for purpose X, why should it be appropriate 
to explain the quality of day-use or camping areas? Why 
should one suppose that Mre or Mrse Average Citizen or park 
planners or managers are able to state the parameters of the 
model they are using to assess quality? 

It is the policy-maker's responsibility to decide 
whether or not an agency's plans and policies for the 
management of a specific resource should reflect only 
current popular values and tastes or if (for example) they 
should focus on conservation or be used to mould future 
values by providing new environments, experiences, programs 
and facilitiese Perceptual studies can be useful for 
marketing purposes, and can provide a basis for interpretive 
programs but they are not appropriate as the exclusive 
vehicle for planning analysiSe In an age of increasing 
automation, it is important to realize that just because a 
model can be deweloped that explains 84 percent of the 
variance in site quality rating for one type of site for one 
type of user, that is no reason to suggest that a planner 
attuned to the objectives for a park is not a better 
"vehicle" to use in planning the parke The planner who sees 
merit in having the way he judges site qualities automated 
Should exercise great care that (1) different models are 
developed in relation to Sites having different purposes, 
and (2) that sohphisticated models like the AID model are 
used to reinforce planners! judgments in preference to ad 
hoc model formulations in which he specifies the importance 
of variables and their values and these are "plugged into" a 
preconstructed equation that may be entirely inappropriate 
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to the way he makes decisionse It must always be remembered 
that perceptions and preferences changeec The planner must 


have the flexibility to handle this change and, at times, to 
influence ite 
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APPENDIX 


ANALYSIS OF VARIANCE, VARIABLES, VALUES 
AND ESTIMATED ANOVA PARAMETER VALUES FOR ATTRACTIVITY 


General Mean = 62167 


VARIABLE LEVEL BETA 
Gradient( Rate of drop) tie O- 7 —-02252 
Feet per Mile 2e 8-132 02252 
Width of Valley Flat fe 1 ~—- 1,295 0-081 
Feet 2e 1,296 - 9,899 0-002 
3e 10,000 -—- 31,999 —-0-e163 
Bank, Full River 1. 35 - 322 0-064 
Width — feet 2e 323 —- 2,302 —-02547 
Je 2,303 — 8,834 02048 
4e 8,835 -—- 9,999 0e435 
Velocity of Current le O - 2 02349 
Feet per Second Z2et3i— 7 006173 
Je 8 - 14 —-0.e231 
Stream Bed Material le Clay or silt —-02e372 
at Edge of River 2e Organic sediment —-0.065 
Je Gravel 0-408 
4e Cobbles 02020 
Se Rocks or Boulders 0-009 
Water Pattern le Smooth —-0.459 
of River at Site 2e Surges —-0.098 
3e Ripples —-0.2480 
4e Chutes and rapids 02248 
5e Torrent or waterfall 06788 
Fluvial Process le Erosion —-02126 
at Site 2e Erosion & Deposition 02029 
3e Deposition 0-697 
Artificial Controls le Free and Natural 06545 
2e Present but unobtrusive 
or dam -0.545 
Accessibility 1e Trail, canoe, or plane 0-586 
Ze * —-0.044 
3e Logging Road 
or shallow draught -0.099 
40 * —-0.-071 
5e Highway or Steamer —-0-381 
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Wilderness 
Pioneer Area 
Resource © Urban 


None 

Infrequent 
unobtrusive 
Obstructed by utility 


None or Many 
Infrequent 
Unobtrusive or Many 


0 - 254 

255 — 898 

899 — 2,498 
2,499 -—- 9,950 


0. er 

Bre 7 

8 —- 23 

24 -— 90 

0 - 14 
iS°- 79 
80 — 254 
255 —- 360 
0 - 3,599 


32600 — 29,583 
29,854 — 31,999 


0 —- 3,843 
3,844 - 31,999 


None 


Very Evident 


Colourtless or White 
Blue or Brown 
Green 


Clear 
Cloudy 
Turbid or Muddy 


None 
Vegetation and/or Oil 
Foam 


Absent 
4 
Infested 


CHAPTER ill 


0.119 
00229 


12200 
—-0.271 


06225 
00577 
-02802 


—-0-926 
06253 
02261 
0.412 


—0e225 
02049 
0-410 


—02262 
Ool2zi 
02410 


—O0e171 
-0-023 
02193 


—-0-055 
0-055 


00287 
—00e287 


00425 
—-02343 
—-0-082 


02053 
00320 
—02372 


-0-014 
—-0-582 
02596 


-0-312 
0-081 
00231 
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Plants at Site le Absent 0-086 


of River 2e K 02282 

Be Infested —-0.368 
Flora le Tundra or Mix Woods —-0.310 
Type of Woods 2e Spruce and Birch 0.2228 


3e Conifers and Hardwood 0-082 


Flora Density 1e Thin -—-1.265 


at Site Ze * —-0-160 
Je * 00383 
4e * 0e398 
Se Dense 0-644 
Flora Diversity te Small —-02819 
at Site Ze * —-02222 
Ge * 02154 
4e * 0e185 
Se Great 0-701 
Mean Depth at River je ne | —-0.-084 
Site - Feet 2e 8 = 50 0-084 
* Notes Some of the levels appear to be missing - these 


levels fall between the upper and lower level and are to 
be read on an intuitive levele 
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REVIEW 


Gordon Ewing 


The four papers in this chapter provide illustrations 
of the three basic ways in which destination attractiveness 
has been estimated in recreation literature and, for that 
matter, ina wider literature in geography .- In the Juuvrand 
et ale paper on wild rivers, the estimate of a site's 
attractiveness is the average of subjects’ stated attraction 
ratingse In Ross's paper on Attractivity Indices, a site's 
attraction is considered to be revealed through a subject's 
actual choices and rejections of that site when compared 
with alternative sites. Thus these two papers exemplify the 
distinction between a "stated degree of attractiveness" as 
opposed to what the economist calls "revealed preference" as 
the basis for estimating site attractione 

The third estimation method is illustrated by Cheung's 
method, discussed in TN 9 and 28 (see also TN 1)e where a 
site's attraction is calculated from some concatenation of 
the site's score on several variables, each weighted 
according to its assumed importance in contributing to site 
attractione The variables used to calculate attraction, the 
score assigned to a variable at a site and the weight 
assigned to that variable in contributing to attraction are 
all based to some degree on the researcher's or planner's 
subjective judgement, usually bolstered to some extent by 
background statistics, Cee statistics on participation 
rates in different activitiese 

The three categories of estimation can be defined as 
based on subject behaviour (the Ross and Cesario methods), 
subject opinion (the Juurand et ale method), and researcher 
opinion (Cheung's method). And between the subject behaviour 
methods of estimation there is a subtle distinction which 
merits notes Ross's method explicitly includes information 
on the circumstances under which a site is not visited by a 
subject, as well as those under which it is visited, whereas 
Cesario's model considers only data on the number of times a 
Site is visitede 

As regards the three categories, it can be argued, ina 
Similar vein to an argument in the Supply Analysis review 
(see Chapter IV) that each approach reflects a conscious 
trade-off between validity and reliability of site 
attraction estimatese The method based on researcher opinion 
has the advantage of being standardizablee Referring for 
example to Cheung's formula (Equation 1 in TN 9), Sle), the 
relative popularity rating of activity e, could be based on 
regional statistics indicating participation in that 
activity and in that sense a standardized set of S(e) values 
could be used for a regione R(m), the relative importance 
rating of facility m as it relates to a particular activity, 
although subjective, could be a walue agreed to either 
nationally or regionallye Only Q(m), the quantity or quality 
score of facility m would be a measure dependent on the 
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Local field workere Consequently, the test-retest 
reliability of the attractiveness measures so derived could 
be relatively high if computations were standardizede 
However, the validity of the measure must be in doubt if it 
is argued that not all the variables relevant to people's 
perception of site attractiveness have been included in the 


calculation, or if coefficients or the way they are 
concatenated do not accurately reflect the way users 
estimate site attractivenessae 

Juurand et atle'’®s paper moves a step closer to 


satisfying the reservations raised above by estimating site 
attractiveness from statements of a sample of userse Buty, 
whilst their ratings are likely to contain more valid 
information about the true attractiveness of a site (since 
the users themselwes are being polled) the test-retest 
reliability of the results will depend on having a 
sufficiently large sample, as well as depending on the 
ability to find a similarly stratified sample for each sitee 
Inevitably, reliability is difficult (but not impossible) to 
achieve using this methode 

The validity of expecting a subject to be able to give 
figures which accurately reflect the relative attractiveness 
of various sites to him has often been questionede Given 
that attractiveness is an abstract notion and that people do 
not necessarily "know" their subconscious evaluations of 
things, any statement from a subject about his inner 
feelings has an unknowable level of validitye The problem is 
well summarized by Bertrand Russell (Analysis of Mind, 1921) 
when he says that: 


The discovery of our own motives can only be made 
by the Same process by which we discover other 
people's, mamely the process of observing our 
actions and inferring the desire which would 
prompt theme 


This argument provides the philosophical basis for the 
paper Ross as well as Cesario'’s model, described in TN @ and 
28e Both researchers use information on actual recreation 
trips to infer estimates of site attractiveness, obviating 


the dependence on stated preference informatione In 
principle such data are likely to produce valid 
attractiveness estimates since people's behaviour is 


presumably an external reflection of what they actually 
think about sites rather than what they say they think of 
sitese In practice, as will be argued below, there are 
certain problems in the methods used by Cesario and Ross 
which raise questions about the validity of these particular 
attractiveness estimation procedurese Moreover, the 
reliablility of the estimates obtained is statistically 
undefined in the sense that it is not clear to what extent 
sampling from a diitferent distribution of origins would 
affect the estimates of destination attractivenesse 

Turning to the individual papers, Ross provided a good 
example ot an innovative attempt to extract site 
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attractiveness information from consumer behavioural data 
without having to make assumptions about what variables 
affect site attractione This contrasts with trip 
distribution modelling which until recently had to 
exogenously define surrogates of destination attractione For 
example, size of destination was often used as such a 
surrogate in shopping trip and migration studiese Ross, by 
contrast, directly estimates attractiveness without having 
to assume it to be related to specific site variablese 

The basis of the estimation procedure is that if more 
distant site j is visited rather than a closer site ky site 
Jj is inferred to be more attractive than ks; otherwise why 
would the extra distance have been incurred to visit j 
rather than k? The number of times j is inferred to be more 
attractive than k (C(j,k) relative to the number of times k 
is inferred to be more attractive than j (C(k,Jj)) is the 
basis of subsequent site attractiveness estimatese However, 
spatial bias in the origin locations of respondents can bias 
the above figures. Specifically if there are fewer subjects 
for whom k is the further of the two sites than there are 
subjects for whom j is the further, then, ceteris paribus, 
the odds are in favour of Clk,j) being less than C( jek), 
Since there are fewer people who choose k as a more distant 
destination than j, compared to the number who can choose 
as a more distant destination than Ke However, in practice 
this biasing effect can be removed, and has been iin 
subsequent uses of Ross's procedure, by calculating: 


(1) P( jek) = CCC jek )/NO jek) ) / 
COCO jo kK)/NO jgk)) + (COCK, J D/NOC ky j))) 

WHERE 

P( j,k) = the proportion of times site j is inferred to be 
more attractive than site k;3 

C( j,k) = the number of times j is chosen by subjects whose 
origins are further from j than from k; and 

NC j,k) = the number of subjects whose origins are further 


from j than from ke 


RATHER THAN: 


(2) PC josk) = CO jek I/(COjgk) + CCKk,Jj)) 


A second, more subtle spatial bias in origin locations 
can also affect the estimate of P( jek) and hence the 
attractiveness estimates of Jj and ke Consider only the set 
of origins, i, of subjects who visited site Jj or k, and 
divide the pairs of Dli,yj) and Dlisk) measures into two 
groupse Group 1 contains all D(i,j) and their associated 
D( i,j) where the subject visited j when Dli,yj) > DCisk), and 
group 2 contains all Dli;sj) and D(Ci,yk) where the subject 
visited k when Dli,yk) > Dliygj)e If, to take an extreme 
example, the DCi,k)'s in the first group were only 
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marginally smaller than their associated D(C i,j)*s we cannot 
tell how much less j would have been chosen and therefore 
how much C( j,k) and CC j,k/N( jek) would decline if the 
D(C i,sgk)*s in that group had been much tess than the D(i,j)'s 
rather than just marginally tesse But in general it is true 
that as Dli,gk) diminishes relative to Dli,j), so Cljsk) and 
C( j,k)/N( jek) also decline, unless j is infinitely more 
attractive than ke Therefore, one cam conclude that the 
values of C(j;k) and C(k,j) are dependent on the distances 
to the further, but chosen, site compared to the nearer 
rejected sitese Only if these pairs of distances were 
similarly distributed for the group of subjects choosing Jj 
when it was the more distant site and for the group choosing 
k when it was the more distant site, would there be no 
spatial bias in the estimation of P( j,k)e Whitst this 
condition can be tested for, it is not clear that any 
practical remedial action can be taken to eliminate the 
effecte And if the condition is common, it may well have a 
serious effect on estimates of site attractivenesses 

Furthermore the discussion above is based on the 
assumption that there are no inter-personal differences in 
users! perceptions of a park's attractiveness, other than 
random differencese If the park has different attractiveness 
to different users (close or far, young or old, rich or 
poor, weekend or weekday ) then the aggregate measure 
derived, Says for day-users is a reflection of the 
composition of the stream of visitors that come to the sitee 
Actually, a site's weekday attractiveness for closeby day- 
users may be high while its weekend attractivity may be nil 
untess users are under a time constraintt Attractiveness to 
whom, from where, visiting for what purpose must be a matter 
of major concern in future worke 

From another perspective, given that the site 
attractiveness measures obtained by Ross are ordinal, 
problems arise in either explaining ordinal scores in terms 
of site variable scores or using ordinal attractiveness 
scores as part of a trip distribution modelling efforte In 
the former casey, if the ordinal scores are assumed to have 
interval properties, a procedure with weak assumptions about 
the distributional characteristics of variables, such as the 
AelTeDe technique used in the paper on wild rivers, may be 
used to explain attractiveness scores in terms of site 
variablese In the case of trip distribution modelling, 
however, it would be difficult to justify the assumption of 
interval properties, in Light of the strongly metric 
assumptions of most trip distribution modelse As a 
consequence, it is probable that these attractiveness scores 
cannot be used for that purposee 

The papers by Beaman and Cheung are concerned with the 
variability of site attractiveness measures that results 
from using different estimation techniquese Beaman compares 
the results obtained by Cesario, Cheung and Ross for the 
Same set of twelve Saskatchewan parkse In observing higher 
correlations between Cesario and Ross's results, the 
suggestion is made that attractiveness measures based on 
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behavioural data cannot help but include the influence of 
the surrounding area on the estimate of attractiveness of a 
particular parke By contrast, Cheung's measure clearly 
relates only to characteristics of the site in questione To 
test this hypothesis, a regression equation of the following 
form is solved: 


(3) T(c)/TC£) = CO — C1ACc) 
WHERE 
T(c) = a park's attractiveness as estimated by Cheung; 


T(f) = the same park's attractiveness as estimated by 
Cesario; and 

A(Cc) = the number of alternative sites within 100 miles 
of the park under consideratione 

A statistically significant R® = e30 is considered to 


support the above hypothesis and new estimates T( Lf), based 
on the above regression solution are obtained as follows: 


(4) T(f) = Tlce)/(C27ACc) — 125) 


The fact that the correlation between T(f) and T(z) 
turns out to be tess than that between T(c) and T(f) 
occasions surprise and is left unexplainede However, it 
should be remarked that in Equation 4 for any particular 
walue of Tic), T(f) is a discontinuous function of Alc). 
Specifically T(f) is a decreasing function of A(c) and takes 
on negative values for values of A(c) between 1 and 4, but 
takes on only positive values for A(c) > 5, although still a 
decreasing function of A(c)e The size of the discontinuity 
between A(c) = 4 and A(c) = 5 depends on the value of Tlc)e 
This discontinuity may explain why the correlation between 
T(£) and T(f£) is poorer than between T(f) and Tlc)e 

In Cheung's paper comparing his own and Cesario'ts park 
attractiveness estimates, the concern is to determine which 
set of estimates better predicts trip flow (VW(i,j))e In 
Cesario's original paper, attractiveness estimates A( j) were 
obtained by calibrating a model of the form: 


(S) VCiej) = KEC4 DAC JIFCCCi,j I IeCis ij) 


where the terms are as defined in Equation 4 of Cheung's 
papere Cheung then takes the A(j) estimates obtained in one 
particularly specified model and uses them as an independent 
Variable to predict Vli,j) in three models specified quite 
differently from Equation 5 (see Equations 8a, Ya and 10a in 
Cheung's paper )e He offers no explanation for having done 
SO, and the transplantation Should be examined closelye For 
example, if Cesario's original equation was properly 
Specified and the others improperly, then it would be 
clearly invalid to say the A(j) estimates performed poorly 
in terms of predicting Vli,j) by testing their performance 


CHAPTER I1i page 233 


in an improperly specified equatione If, on the other hand, | 


Cesario's equation was not properly specified, it is 


difficult to see what can be proven by taking his estimates 
from that equation and testing their predictive ability in. 


what may just as well be another improperly specified modele 
In general, it is invalid to take estimates of A(j) obtained 
in one equation predicting a Ziven variable, in this case 
V(i,j) and to test the predictive ability of the same A( j) 


estimates in a differently structured equation with the same 
dependent variable, V(ixyj)e Therefore, any inferences made 


from comparisons of the predictive abilities of Cheung's and 
Cesario's attractiveness estimates are of doubtful validitye 

Scrutiny of the least-squares estimates in regression 
Equations 8 through 10a in Cheung's paper also raises the 
question of whether the standardized regression coefficients 
in each pair of equations are statistically significantly 
differente Certainly the unstandardized coefficients 
provided in Tables 2 through 7 are remarkably similar, 
except where the scale of magnitude of T(c) values relative 
to T(£) values affects their regression coefficient values 
(Cheung identifies Cesario's A(j) estimates as T(f£) and his 
own aS Tlc))e In addition, it is really only in Equations 10 
and 10a that the independent effects of T(c) and T(f) can be 
judgede In Equations 8 and 8a the explanation of variance in 
V(iszsji) is dominated by the etfect of origin population 
(PCi)d, and T(f)/eg(DCis,gj)) and Tle )/e(DCli,gzj)) contribute only 
e4% and e2% respectively to the total variance explainede In 
Equations S and Qa the independent variables both have 
distance components embedded within them, so that it is 
impossible to tell how much of the 686% of variance in 
VCiegj)/Popli) explained by T(f)/ge(D(i,j)) is attributable to 
e(0Ci,j)) and how much to Tlf)de Only in Equations 10 and 10a 
are the independent effects of log Tc) and log T(f) on Log 
C(VCi,j)J/Pop(i)) measurable and it is notable that in both 
equations, log Dlisj) dominates in explaining 64% of 
variance anc log Tlc) and log T(£f) explain only e2% and 121% 
of variance respectivelye This suggests that in Equations 9 
and 9a perhaps g(D(i,j)) is the main explanatory variable 
and that the t€c) and T(f) terms have only a small random 
effect on the Level of prediction of V(i,yj)/Popli )e 

It is ditficult to conclude from the above either that 
T( fT) and Tlic) have any appreciable effect on the variance of 
the three dependent variables or that the small influence 
they may have aiffers markedly between T(f) and Tlc)e Given 
this and the question ot whether the procedure usea to test 
the predictive ability of the T(f) values is valid, one is 
led to conclude that nothing meaningful has been said about 
the relative predictive capacities of the Cheung and Cesario 
attractiveness measurese 

Turning to the question of whether there is any 
eitfective way to compare the predictive abilities of two 
differently derived sets of attractiveness scores, it is 
cifficult to see how an effective comparison could be made 
in this caSsee Cesario's set of estimates are parameters 
estimated by fittinec an equation to the very interaction 
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data to be predicted, whilst Cheung's estimates have much 
less flexibility in the sense that they are based on a 
predetermined formula with no free parameterse It would 
therefore be invalid to take Cheung's T(c) values and use 
them in Cesario's model (Equation 5) since there would be 
fewer free parameters in this case than in Cesario's case 
where the T(f) values are free parameters to be estimatede 
The conclusion is that it is perhaps a vain 
oversimplification to hope to make direct comparisons of 
this kind, where the estimating procedures are so radically 
differente 

The final paper in Chapter III on the perception of the 
quality of wild rivers, unlike the others, is not so much 
concerned with methods of estimating site attractiveness as 
with the explanation of these estimates, however obtained, 
in terms of site characteristicse In particular, it 
illustrates three different mathematical models which 
explain perceived site attractiveness scores in terms of 
perceived site characteristicse The less rigid the 
assumptions of the model, the more variance in the dependent 
variable is explainede The multiple regression model assumes 
the dependent variable to be a linear and additive function 
of the independent site characteristic variables and 
achieves an R® of e38e By contrast, a generalized analysis 
of variance model, though additive, allows independent 
variables to be defined categorically rather than 
continuously and obtain an R® of e59e Finally, AeIeDe with 
the added facility of allowing interaction effects between 
independent variables, gives an R® of 0¢84- The latter would 
seem particularly useful for planning in cases where as weak 
assumptions as possible must be made about data, and where 
subtle relationships between variables are thought to existe 
Inevitably the price of this greater flexibility is a 
reduction in the ease of interpreting results in the form of 
an AelIeDe tree diagrame 
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CHAPTER IV 


ORIGIN MODELS 


INTRODUCTION 


Knetsch, in line with this general plan for the CORD 
Study (see Chapter 1)s suggested that work done by 
Cicchetti, Seneca and Davidson in the United States should 
form the basis of a procedure for analyzing CORD Study 
National Survey data collected on the level of peoples 
participation in outdoor activitiese However, after some 
preliminary work, this analysis approach was not used 
because examination of data indicated that this kind of 
analysis simply could not be carried out in 1971 because of 
the need for certain accurate supply informatione Instead, a 
proposal by Hendry (see Reference 8) for a "dummy variable 
analysis" of the National Survey data was pursued, so as to 
allow the socio-economic variable effects on participation 
to be calculated, even though some of those variables were 
nominale TN 12, "Analysis of Variance as a Tool for 
Estimating Participation in Outdoor Recreation Activities" 
presents the version ot the model proposed by Hendry that 
was eventually usede It goes beyond calculating "effects" 
of variables by describing procedures for estimating the 
number of people that a city will generate as participants 
in a given activity, and indicates how parameters calculated 
on the basis of behaviour reported in a National Survey can 
be used to estimate the amount of participation iny for 
example, hunting in Quebece 

However, when estimates are made using the model 
proposed, questions arise about the validity of procedures 
proposed for the use of Analysis of Variance in estimating 
participation in outdoor activitiese Some of these questions 
relate to three considerations: (1) the effect of supply on 
participation, (2) the accuracy of estimates made, and (3) 
the structural deficiencies of models developede Issues 
related to supply influences are taken up in the chapter on 
Supply Analysise In this chapter there are papers on the 
accuracy of estimates of participation and structural 
deficiencies of modelse Matters not raised above are 
numerouSe For example there has been concern about the 
accuracy of origin models related to the low magnitude of R@ 
found when models are parameterizede This topic and some 
related concerns are taken up in TN 36 in Chapter VIIe 


CHAPTER IV page 238 


| Other problems relate to the stability of model 
coefficients over timee TN 13, in Chapter IX, deals 
specifically with pro jectionse In it the author sets up a 
“context of validity" for the projection procedure described 
in TN 12 and thus further clarifies methodological issues 
related to using origin modelse 
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CORD STUDY TECHNICAL NOTE 12 


ANALYSIS OF VARIANCE AS A TOOL FOR ESTIMATING 
PARTICIPATION IN OUTDOOR RECREATION ACTIVITIES 


Se Rousseau, Je Beaman, Me Renoux, Je Hendry 


ABSTRACT 


The use of analysis of variance in attempting to arrive 
at an understanding of participation in outdoor recreation 
is probably best known from the its use in the Mueller and 
Gurin volume of the ORRRC reporte There the authors 


discussed how the estimated effects of socio-economic 
characteristics on people's participation provided insight 
into recreational behaviour that could not have been 


obtained by merely tabulating datae 

This paper goes beyond that and deals with applying 
analysis of variance to data pertaining to a certain 
recreation activity to estimate Participation in that 
activity by people in a given geographic area, conditional 
on the socio-economic characteristics of these peoplee 

Specific formulae for making estimates are given in the 
paper and their use is illustrated by making predictions of 


the number of hunters and total hunting trips by Quebec. 


residentse 

The paper concludes with a discussion pointing out some 
difficulties encountered in using the methods of estimation 
describede The resons for having alternative methods for 
estimating total volume of activity (total hunting trips) is 
an important topic taken up in this section of the papere 

There are references in the paper to a number of papers 
in which further results such as (1) when such a_ model 
Should be usede (2) accuracy of results, (3) structural 
problems with the models derived; and (4) the value of R¢ 
Should have, have been presentede 


INTRODUCTION 


As early as 1961 there was a paper produced which 
presented the results of analysis of variance on how having 
different levels of income or belonging to a particular 
socio-economic category influenced a person's participation 
in outdoor activitiese In that study by Mueller and Gurin 
(see Reference 9), the authors went so far as to recognize 
that the same model of how participation related to socio- 
economic variables was not appropriate for both males and 
females because of what are known as interaction effectse 
Other work has influenced the production of the models 
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presented heree From within the CORD Study, one influence 
was the proposal by Hendry (see Reference 8) that CORD Study 
National Survey data should be processed by a dummy variable 
analysis; this is the economists way of saying that a 
variant of the kind of analysis described here should be 
undertakene Knetsch also made a proposal that CORD Study 
national survey analysis should follow a strategy that was 
laid out by Chiccetti, Seneca and Davidsone (See Chapter Ie) 
So, in the history of recreation research and in the history 
of the CORD Study there are suggestions that a model that 
may be expressed in words as follows should be used in 
analyzing people's outdoor recreation participation: 


Equation i (Form 1): 


Probability = a general + Effect of + the effect of 


of partici- participa-— being ina being a member 
pation or tion level city, ina of + etce 
frequency of town or in a certain 
participation in country size 

for a person household 

with socio- 

economic 


characteristics 
In mathematical terms the equation is: 
Y( igJeKeLy ece) = UtFB( 1,J )+ BC 2,K It BC 3gL tee eo tC(i ) 


WHERE Y(Ce) is 0 or 1 for participation or nonparticipation 
or for a frequency model is the actual number of times 
that a person participated; is Jy Ky Ly eee, the sub 
scripts of Y(e) give information about the person i 
who has level J of a first socio-economic variable 
(e@ege in Figure 1 comes from ae household of some 
size), who has tlevel L of a third socio-economic 
variable (eege education level), etce 


U is a general level that applies to all persons (from Table 
1 for participation by male hunters for 1972" it is 
e234); 


B(1,J) is the effect on Y(-) of having tlevel J of socio- 
economic variables 1 (under 19722 of Table 1, Bl1i,2) = 
e006); 


B(2,K) is the effect on Y of having level K of socio- 
economic variables 2 (eege in Table 1 for household 


Size for 1272 #(2) the effect is —-.002)3 


B(3,L) is the effect on Y of having level L of socio- 
economic variable 33 


B(e )'s with first subscripts up to 9 would be necessary to 
define all the effects shown in Figure 1 and given in 
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Table 1 C(i) is an error term that has a value equal 
to the difference between the observed Y for person i. 
and his predicted i (this is illustrated 
subsequently )e 


Having referred to Table i and Figure i it seems 
appropriate to give some general explanation about these and 
the related Tables 2 through 4 and Figure 2. These are from 
a larger document that was originally to be an appendix to 
TN 12, Rousseau's “appendix". The Table of Contents of this 
Larger document, which was prepared by Rousseau; one of the. 
authors of this paper, is now all that is included as an 
appendixe | 

In Rousseau'ts original “appendix", all results of the 
many possible first order effect analyses of peoples!" 
participation and frequency of participation that could be 
carried out on National Survey Information that were 
available in 1974 were presented in figures and tablese In 
1974 it was only possible to process 1969 and 1972 CORD 
Study national survey datae The 1969 data are for persons 13 
years of age and over and the 1972 data are for people 10 
years of age and overe Thus the three options arose of 
processing 1969 data, 1972 data for persons 18+, and 1972 
data for 10+ to obtain the three somewhat different sets of 
coefficients shown in the figures and recorded in the tables 
as 1969, 1972% and 19727. Subsequently it tas become 
possible to process CORD Study Nationati Survey, 1967 data on 
peoples! participation in outdoor activities. But if one 
examines the results presented here on hunting participation 
and hunting frequency, one will note that there is generally 
good agreement between the 1969 and 1972 differentials for 
both modelse So 1967 analyses have not been carried out 
because it was considered that the work involved in 
incorporating these data into the existing summary 
information would not be justified by any benefits that 
might be derived by showing the similarity of 67 results to 
those for 1969 and 1972. 

As for the possibility of discerning a trend in the 
coefficients, one need only look at the standard deviations 
of the coefficients to see that one need have no great 
expectations that any kind of reliable trends in 
coefficients can be discerned based on the survey results 
availablee 

The preceding comments may have brought @ particular 
matter of concern to the attention of some readerse What is 
the meaning, in some practical sense, of the table? Before 
there is any merit in being concerned about variation in 
these coefficients, obviously there must be a realization of 
what information the coefficients themselves put acrosse To 
see specifically what the coefficients indicate, an example 
is usefule Assume one wants to predict the probability of 
being a hunter in 1969 for a person who in i969 was (1) 
male, (2) from a city of over 100,000, (3) married, (4) from 
a family of size three or four, (5) with some highschool 
education, (6) in the age group 30 to 39, (7) with an income 


| 
| 
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TABLE 1 


ESTIMATED PARAMETER VALUES FOR HUNTING PARTICIPATION 


General means for Males 


w6o" eho r 1969 
O72 ye 1972") 
toroid! 234 197212) 


CITY SIZE EFFECTS FOR MALES 

seis Labels 1969 1972") 1972'2) 1969 
B(1,1) Over 100,000 -,091 -.144 

B (1,2) 30,000-100,000 ~,022 EO ae -006 

B(13) 10,000-30,000 SOT 090 

811.4) 1000-10,000 = 005 

B15) ural "106 


PERSONS IN HOUSEHOLD 


B(2,1) One 

B (2,2) Two 

B (2,3) Three or Four 

B(2,4) Five 

B (2,5) Six or More 

EDUCATION 

B(3,1) Public School, Refuse .00L wl OA 


B (3,2) Some High School See .043 mw e064 S| 
B (3,3) High School Grad. or More =, 17 ee OLGe lo <8 O00 um | 


AGE 

B(4,1) 10 to17 

8 (4,2) 18 to 29 

B (4,3) 30to 39 

B (4,4) 40 and Over 
INCOME 

8 (5,1) Refuse, Don't Know 
8 (5.2) Less than $2,999 
B (5.3) $3,000. to $5,999 
B (5.4) $6,000. to $10,499, 
B (5,5) $10,500. or More 


MARITAL STATUS 


B(6.1) Single 
8 (6,2) Married 
B(6,3) Other 


POSITION IN HOUSEHOLD 


B (7,1) Head (Male or Female) 
B (7.2) Son or Daughter 
B (7,3) Other 


CHILDREN UNDER § 
-004 =. 002 


B (8.1) None 

B (8,2) Some 016 002 
HOUSING 

B(9,1) Single 

B (9.2) Row, Duplex, Multi 

B (9,3) Apartments 


(1) 18 years of age and over 
(2) 10 years of age and older 


% The 199 and 1972 Frequency fieures are not comparable because frequency catagories 
are not identical for both years, 
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1972!") 


General means for Females 


EFFECTS FOR FEMALES 


1972!@) 
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FIGURE 1 
BETA VALUES FOR 


HUNTING PARTICIPATION 


IN 1969 ==, 1972 (1) —=AND 1972(2).--- 
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of $6,000 to $10,499 in 1969, (8) the head of the household, 
and (9) with no children under 5 at his home (10) which was 
a single family dwellinge To determine this probability 
based on the 1969 survey, one simply takes the general mean 
for males of .219 given in Table i and adds to it the 
relevant increments or beta values from the first column, 
the column under 1969 of Table 1, to obtain the equation 
below, which indicates that the probability of participating 
dis e219 — .091 + 2.011 + 2.016 + '.011 + 2.072 + «031 - eO1 - 
2016 + 2.012 = e255 
| The coefficients in Table 3 can be used in the same way 
to get a prediction of the number of times that a similar 
male can be expected to go huntinge 

The reason for presenting the results of the analysis 
graphically is that the large array of numbers in Table 1 
does not give a quick impression of peoples! behavioure The 
graphs of the regression coeffients show that (for hunting 
participation) there are fairly distinct trends both for 
males ana femalese . Negative differentials show the Low 
probability of hunting, for people in large citiese One also 
sees the shift to positive differentials for people from 
small communities and rural areaSe With age,y once one passes 
the age 16 at which hunting is legal, one notes that there 
is a decrease in hunting participation with increasing agee 
This decrease in participation in an activity with 
increasing age is something that almost all activities have 
in common and which to some extent reflects a general 
tendency of most people to become less active when they 
become oldere By looking at the coefficients on the 
relationship between participation and numbers of persons in 
the household, one sees that there are not strong effects 
compared to the age effects or city size effectse 

Tables 2 and 4 present standard deviations values for 
the regression coefficients that are plotted in the graphs 
and which are reported in Tables 1 and Ge These standard 
deviations give one an idea of the size of deviation between 
the "true" coefficient and its estimate, which has a very 
high probability of occuringe If one can assume that the 
distribution of the coefficients is somewhat near normal, 
then the characteristics of the normal distribution suggest 
that deviations of about 1.56 times the standard deviations 
reported have a relatively low probability of occuring (less 
than 1 chance in 10)e So for example, one may note that the 
hunting coefficient showing the decrease in probability of a 
- e0S1 that was observed for being ina city of 100,000 in 
19692 thas a standard deviation of -02- This means that there 
is a high probability that this coefficient could be as 
large as ell or as small as e071 but, in line with the point 
just made, the probability that it will be greater than e112 
or less than e06 is quite remotee In the case of the 
coefficients that apply to the number of persons in the 
household, one may observe that all of these have an 
absolute value of under e02 whereas all of the standard 
deviations for these coefficients are over e023; this isa 
clear indication that one can accept the hypothesis that all 
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ot these coefficients equal zero: in other words that” 
persons in the household need not be a variable considered. 
in predicting hunting participatione 

Given the previous statements, it seemed rather awkward. 
to go back and indicate that the coefficients were not) 
computed in the most efficient way possible and that the | 
variance estimates may be larger than they should bee The 
point is that there are a number rather tricky statistical | 
issues related to the need to ‘carry out special weighted | 
regressionSe As indicated in the Review of Chapter VII, | 
carrying out these special regressions would have resulted. 
in additional expense that was unnecessary because with the 
number of observations availablee Weighted regression | 
apparently does not produce parameters that are perceptably | 
more accurate than an unweighted analysest Also other work 
showed that the models developed had other problemse Results 
presented in TN 20 show that the kinds of simple models > 
presented here very often explain only about half the | 
variance that should be explained by socio-economic | 
characteristicse This is because the simple autonomous © 
effects of socio-economic variables assumed to apply here 
are not complicated enough to mirror realitye However, the 
method is still appropriate to making predictions with a - 
more complicated model, so there is merit in presenting the 
method and noting its problemSe 3 

The reader may note that a hunting model is tested in 
Technical Note 6 and is accepted to be structurally 
appropriate to the 1972 data on male residents of Canada's 
participation in the activity hunting obtained in the 1972 
National Survey of Canadians’ participation in outdoor 
activitiese In that same note it is explained why results 
derived in TN 29 show that the hunting model derived here is 
not good because the effect of supply in participation is 
not considerede In the next section of this paper the 
rationale behind using an equation such as the one 
introduced in making predictions is presentede But the 
accuracy of the predictions that can be made using such a 
model are taken up ina separate paper (TN 6). As already 
implied, structural problems with the equation (in terms of 
uSing it in modelling people's behaviour) are taken up in 
several other paperSse The matter of whether or not 
interaction effects exist and their magnitude is the topic 
of TN 20. TN 29 deals with the matter of whether the effect 
of supply can and/or should be incorporated into Equation le 
TN 35 presents the results of research on what the value of 
R-squared should have when regression is carried out on 
survey data to estimate the parameters of Equation 1 (form 
2) and how one can test for model validitye (See also the 
Review of Chapter 7 of this volume )e 

In the way of further introduction, the reader may tind 
it interesting to note that the present version of TN 12 is 
not the original version which was releasede A great deal of 
work on applying the analysis of variance model for making 
predications has been carried out by Renoux, 1873, 1975 (see 
Reference 11, 12)-e¢ The CORD Study research, cited above has 


TN’ 12 CHAPTER IV page 246 


TN 


ESTIMATED PARAMETER VALUES FOR 


IABLE 2 


STANDARD DEVIATIONS OF 


General means for Males 


1969 .038 
1972") .030 _ 
1972) 028 d 


EFFECTS FOR MALES 


1972!°) 


1972!) 


HUNTING PARTICIPATION 


General means for Females 


1969 — Oa 
tole tes Ons. 
1972") “O12 


EFFECTS FOR FEMALES 


1972") 


1969 


.008 


e012 


GUTY SVZE 

ae 

wos Labels 
B(1,1) Over 100 000 

B (1,2) 30.000- 100.000 
B (1,3) 10,000-30,000 
B (1,4) 1000-10000 

B (1, 5) Rural 


PERSONS !IN HOUSEHOLD 


B(2.1) One 

B(2.2) Two 

B (2.3) Three or Four 

B(2.4) Five 

B (2,5) Six or More 

EDUCATION 

B (3,1) Public School, Refuse zx 0ya)) TaOcle 7. pei OY 5755 z yc crome WE? . 006 
82.2) Some High Schoo! 01 ~015 013 006 | .006 “1 006 

B (3.3) High Schoo! Grad. or More ROBES Tet ES E P00'6 - 006 Weg ea OOM SL sone 
AGE oth! OS 

8 (bo) | aie? s bite tee ters 2 ays ~014 4 
B (4.2) 18 to 29 ge 082 008 wOOrs pal hee O08... 
B (4.3) 30 to 39 A O\2k 007 -008 AO ENO} 
B(4.4) 40 and Over L S onde Um “2098 | a 
INCOME _ oh 

B (5.1) Refuse, Don't Know 02 1 =i Ol Zi t a 

8 (52) Less than $2999 me 2 een aliale : : 
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TABLE 3 


ESTIMATED PARAMETER VALUES FOR II"INTING FREQUENCY* 


General means for Males General means fof Females 

1960 1 06T 169" [OG =.= 

1972") V736.. 1972" 2 69340 ¢ 

19722) 1,861. / 1972") oga4....... 
CITY SIZE EFFECTS FOR MALES EFFECTS FOR FEMALES 
nie Labels 1969 1972") 19722) 1969 1972" 497212) 
Gicaitiectersoorose 2 LS282.. |i = S06300) = 739 oe Bou 
8 (1,2) 30.000- 100,000 mee RAS oy eae 
B (1.3) 10,000-30,000 5.023 1.039 P69 20-27 | Ak peg 9 dealt 
B (1,4) 1,000- 10,000 a a iy Se EES a En | To MLS ass a 
B (1,5) Rural HU Znais| ne pO Sate) Luh Ode le Hae te eee 


PERSONS IN HOUSEHOLD 


B(2.1) One 

B (2,2) Two 

8B (2,3) Three or Four 

B(2.4) Five 

B (2,5) Six or More 

EOUCATION 

B (3,1) Public School, Refuse 

B (3.2) Some High School 

B (3.3) High School Grad, or More 

AGE 

B (4,1) 10 to17 

8 (4.2) 18 to 29 

B (4,3) 30to 39 

B (4,4) 40 and Over 

INCOME 

8 (5,1) Refuse, Don't Know = 266 

8 (5.2) Less than $2,999. 

B (5.3) $3,000. to $5,999. -.067 Sach ON/ES .096 
B (5.4) $6,000 to $10,499, ~.080 OSA LASS 
B (5.5) $10,500, or More PL a a 


MARITAL STATUS 


B (6. 1) 
B (6,2) 
B (6,3) 


Single 
Married 
Other 


POSITION IN HOUSEHOLD 


B (7,1) 
B (7,2) 
B (7,3) 


CHILOREN UNDER 5 


B (8.1) None 

B (8.2) Some 

HOUSING 

B (9,1) Single 

B (9.2) Row, Duplex, Multi 
B (9,3) Apartments 


12 


Head (Male or Female) 
Son or Daughter 
Other 


(1) 18 years of age and over 
(2) 10 years of age and older 


% The 1969 and 1972 Frequency figures are not comparable because frequency catagories 
are not identical for both years, 
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CITY SIZE 

Pee Labels 
B (1,1) Over 100,000 
B(1,.2) 30.09N9- 190000 
B(1.3) 10,000- se woa 
B (1,4) 1000-10,000 

B (1,5) Rural 


PERSONS IN HOUSEHOLD 
B(2.1) One 
B (2,2) Two 


B (2.3) Three or Four 

B (2,4) Five 

B (2,5) Six or More 
EDUCATION 

B(3,1) Public School, Refuse 
8(3.2) Some High School 

B (3.3) High School Grad. or More 
AGE 

B(4.1) 10 to17 

8 (4.2) 18 to 29 

B (4.3) 30to 39 

B (4.4) 40 and Over 
INCOME 

8 (5,1) Refuse, Don't Know 

8 (52) Less than $2,999 

8 (5.3) $3,000. to $5,999 

B (5.4) $6900 to $10,499 

B (5.5) $10,500 or More 


MARITAL STATUS 


8 (6,1) Single 
B(6.2) Marned 
8 (6,3) Other 
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| 
occurred since the original TN 12 was preparede This. 
research has resulted in the recognition of a number of 
practical problems that are encountered in making 
computations which yield predictions and in the recognition 
of a number of theoretical problemse So a re-examination of. 
the material resulted in the preparation of this present 
note and the revision of some other notese 


/ 
DEFINITION OF THE PROJECTION MODEL 
THEORY AND APPLICATION 
Given the equations specified earlier as Equation 1, it 
is easy to see that for all of the individuals in a certain 
geographic area one could compute their probability of 
participating in a given activity or compute their frequency 
of participation in that given activity, as long as one knew 
the socio-economic characteristics of each individual or if 
it is only necessary to have certain census informatione 
Visualize for the moment that one has an equation for each 
individuale Then, for example, for mates one has’ the 
Situation indicated in Figure 3-e There are a collection of 
equations with regression coefficients multiplied by zero or 
1 and if adding up these zero or ones produces the numbers 
shown at the bottom of the various columnSe 
For example, in column 2 of the figure, one gets T(I) x 
U as atotal because if the U's are added over all of the 
population, T(I) people, one must get T(I) U's, or T(I) x U 
as a totale The only difference in the other columns is that 
there are both O's and l's, so certain B's are multiplied by 
l*'s whereas the others are multiplied by O'se Now some of 
the B's are multiplied by 1 because the people have that 
particular characteristic (e@ege they Live in a city of over 
100,000 population)e Soy when one adds up the U's in a 
column without considering the B's, one finds out how many 
of the T(I) people have the particular characteristic which 
is being considerede Obviously, for each socio-economic 
variable the number of people in each Level of the variables 
adds up to the total number of people, TC I)e So what is 
depicted in Figure 3 is the fact that individual equations 
for people can be added up and there is no need to consider 
equations for each individual but only to take the 
regression coefficients and multiply them by certain data, 
for example, from the census (eege data on how many females 
in Quebec come from cities of over 100,000 or come from 
certain household size categories, etce)e One can make 
estimates without knowing individual characteristics but by 
knowing the information indicated in Equation 2 following: 


(2) NCi) = UCi)ITCIi) + C DE Bligl,yjInCirgi,j)) 
+ ( @E (BCi,s2,j)nli,2,j)) + alate 


WHERE N(1) is the number of males participating in some 


activity and 
N(2) is the number of femalese The U(i) and Bligeye)'s are 
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the parameters as defined earliere They are male's. 
parameters if i= 1 and female's parameters if i= 2e | 


Incidentally, the subscript for sex has been introduced. 
to stress the importance of dealing with the two sexes. 
separately for most activitiese The importance of carrying 
out analyses where N(i)"s for the two sexes are predicted. 
independently can be very easily understoode By Looking at. 
Figure 1 one sees that female participation in hunting has” 
coefficients which look somewhat Similar to the male. 
coefficients but the male and female coefficients are not 
related so that one simply adds a constant value to the. 
female effects to get the male effectse The female effects 
are scaled-down versions of the male effects and usually run. 
from a quarter to a third as largee They tend to be in the. 
same positive or negative directione This multiplicative 
relation between coefficients is a true interaction effect 
such as described in many textse Renoux (see Reference 119 
12) has presented a similar example in his analysis of the 
CORD Study datae Interactions are discussed from a different. 
perspective in TN 20e 

If one has an N(1) and an N(2) based on Equation 2, to. 
get results for the total population one obviously need only 
add together male results and female resultse However, 
Equation 2 was Simply stated as giving an equation for 
predicting participatione This is because there are two 
approaches that may be taken to predicting frequency of | 
participatione One is the approach which is implicit in the 
way that Equation 1 was defined and suggests that frequency 
be predicted as indicated in Equation 3: 


(3) TPCi)d = UCiITCi) + DE DE Bliglsj) nl i,t, j) 


WHFRE the sums are ont andj and where TP(i) is total. 
amount of participation ina given activity for males (i=1) 
or females (i=2)e The other terms are as defined before, 
except they are the coefficients for a frequency model (see. 
Figure 2 and Table 3) rather than the coefficients for a 
participation modele | 

Rather than using the equation for total participation 
Just indicated, data can be used to obtain an average 
frequency of participation in huntinge For example, this may 
be computed by adding up the number of trips that each | 
participant makes and dividing by a total number of. 
participantse Actually, the average number figure should 
probably be disaggregated according to rural/ urban or in. 
some other way that is consistent with the socio-economic. 
variables considered in the analysise This is so that as the 
population changes an appropriate change can be made in the | 
amount of participation predictede But, that was not done. 
heree Equation 4 is not written in such a way as to allow 
for this: | 


(4) TP = #(01)NC1) + F02)N(2) 
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WHERE TP is total amount of participation in a given 


activity for males and females, the N(i) are as defined by | 


Equation 2 and the #(i) are the average frequencies of 


participation for each sexe Really, the equation shows that 
one multiplies each of two types of participants by their 
average frequencies of participatione Renoux (see Reference | 
11, pages 66-67) has presented average frequency figures of | 


(1) = 4e4 and #(2) = 3-0 for hunting in Quebec. 

Given the kind of equations just indicated, there is no 
need to use them with population figures for the year in 
which the data were collected on which the Ble)*s were 
determinede Obviously one can make predictions by using the 


equationse One can also see how the "prediction" equations | 


introduced earlier are used by seeing that in each of Tables 
5 and 6 the numbers of people in various socio-economic 
categories are multiplied by the effects associated with 
given socio-economic - categoriese These products have been 
added together in the ways specified by the equations, with 
the actual estimates being shown at the bottom of the tables 
as totalse The results of carrying out these computations 
are summarized in Table 7e In terms of computations, it is 
actually a very simple procedure to make projections using 
this kind of equatione 

The real problem relates to determining the N( )'*s, the 
numbers of people that are predicted to be in certain socio- 
economic groups at some time in the futuree One should note 
that for the example in Tables 5 and 6 estimates were made 
of what the total Quebec population would be for males and 
for females in 1980. Estimates were also made of how many 
people would be in certain socio-economic categoriese Care 
was then taken to see that the predictions as to the number 
oi people in socio-economic categories agreed in total with 
the total number of people available to be in these 
categories: 


(5) TCi) = OE hl igks j) 


WHFRE the sum is over ail values ot j and the condition must 
hold for all k, for all socio-economic variables considerede 

It may seem odd that a point has been made of the fact 
that the number of people in various levels for socio- 
economic categories should add to the total number of people 
in a populatione However, this has actually been found to be 
a problem for several reasonSe For one thing, some 
researchers have projected the number of people in various 
socio-economic categories by drawing trend lines through the 
numbers of people in these categories at a number of census 
yearse From this procedure they get estimated numbers of 
people in cities over 100,000 at future yearse From another 
curve, estimated number of people in various other city size 


ee 


categories are obtained in a Similar waye A trend Line is | 
drawn to determine the total number of people in the | 


population being considered at some future points in timee 
There is no constraint that makes the prediction of "total 
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TABLE 5 


THE PROJECTED HUNTERS IN QUEBEC IN 1980 CONSIDERING SEX, AGE, INCOME AND 
URBANIZATION IN THAT YEAR 


FEMALE 


GENERAL 
PARTICIPATION 


GENERAL | , 
PRRTICIPAII sie 


TION RATE POPULATION 


MEAN s 
2,150,782 0.274 589,314 | 29214, 707 41,502 
VARIABLES 


TOTAL | TOTAL 
PERSONS EFFECT EFFECTS PERSONS EFFECTS 


AGE ; 
e219 94,626 =02033 fy oles | 89,788 0.126 LigvSe3 


: e 
20-24 346,550 971385 46,784 || 341,821 0.078 26,662 
fo-29 2935233 =02045 POLS, 195 | 269,524 0.055 14,824 
0.181 [34,226 | Sen |e ans 


oo~ 39 219,706 DL Or 2,636 || 214,098 -0.038 - 8,136 
40-44 198,264 -0.025 - 4,957 || 195,646 -~0.037 - 7,239 


i 
Ld 
Ee 
B 50-64 | 414,605 | -0.175, |-. 72,556 || 441,189 |-0.048 | -21,177 

PP ed 
aa are eee bot aes |e 16, 022 | 


346,547 - 49,556 


2000-4499 b16537i| s0s067¢ |r ec2.773 | 
4500-5999 4493941 =O0c022. J=o55,.977-] 
Ud 


| 7500-8999 | 387,713 [7 0.061 [23,650 [| 410,053 [-o.006 | - 2,460 __ | 
| 9000-10499 | 331,777[ 0.138 | 45,785 || 350,694 | 0.015 | 5,263 | 
M00. | 707,538i| 30,011 [o7,783 || 748,305 [o.016<7 | pel b/973/, | 


6000-7499 62172/6 0.043 13,815 


URBANIZA- 


24,161 
TION 


| 1000-9999 244,701 =O. 114 - 27,896 =i. 5. Sel 


tooo0-29999 | 191,021{ 0.162 | 30,945 || 198,139 | 0.016 | - 3,170 | 
mpnc0-99999 | 3198 ,/309]] =0.1036.. |= 917,139. f4.214,330 5] -O00L7 2 (| 8644 
roog0o- 1,198,204] -0.156 [-186,920 |1,309,365 [0.012 | -15,712 | 
Rural | 318,547{ 0.072 | 22,935 | 299,494 | 0.034 | 10,183 


Merce co | besos |ste0,078 | oor 0) vba. a*| | 17,684 
a Pr ae Re 


TOTAL HUNTERS (TOTAL MALES & FEMALES PARTICIPATING) _ 489,009 


is Tepe hl 19,522 
253,379 | -0.021 
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TABLE 6 


THE PROJECTED HUNTING FREQUENCY IN QUEBEC IN 1980 BY SEX, AGE, INCOME 


AND 


URBANIZATION 


FEMALE | 


GENERAL 
TOTAL AVERAGE 


POPULATION PER CAPITA 
RE QUENCY 


20-24 346,550 0.742 257,140{| 341,821 0.315 107,674 | 
-0.318 |- 93,248 


40-44 198,264 OL128 2572.18 195,646 =Orls/ - 26,804 ff 


TOTAL TOTAL 
VARIABLES PERSONS EFFECT BEFECTS PERSONS EFFECT EFFECTS 


| 
2150782 1.053 | 2264773 || 2274707 0.042 95537 | 
| 

| 

18-19 94,626 0.039 3,690]} 89,788 0.353 31,695 | 


GENERAL | 
GENERAL AVERAGE f 
MEAN POPULATION |PER CAPITA | 

FREQUENCY 


2554490 =O, 121 = 30,090 


214,098 


U 

a | . 
269,524 0.186 20,131 if 

| 

F i 


- 25,7681 ae 


366,514 -0.044 S26 027 ae 


12,202 =0.087_ | = 1,062] 


a 

pr Bos 241" | 
25-29 293,233 
30-34 189,092 0.332 62,779 
35-39 219,706 0.280 61,518 

| 40-44 
50-64 414,605 | -0.850 |- 352,414 

226,049 

Lid ine toaniet: ipl, 244 

INCOME 
0=2999 SAGSSARR |S ORG 2280. 583 


2000-4499 8B My ge =0.232 = 2,017 
14,428 


6000-7499 321,276 69291 74,215|| 339,787 0.070 23,785 || 
0.077 29,854|| 410,053 0.071 29,114 | 
9000-10499 | 331,777 0. 366 121,430|| 350,894 0.025 8,772 | 


U 
4500-5999 44,394 O3320 
g 
7500-8999 387 7743 
| 9000-10499 | ; 
10500+ TOLroa8 -~0.097 
PSGbetota Ley lg i Mii nmL ears 
URBANIZA- 
TION 
1000-999 244,701 =O01320 * 
1009029999 191,021 -0:.020 + 
30000-99999 | 198,309 C206) ) » 
100000+ 1,198,204 -0.470 = 
Rural 318,547 OP YAO) 
Ssube Total 4) 1. 5 
TOTAL 


TOTAL NUMBER OF TRIPS ESTIMATED (MALES PLUS FEMALES) __ 1,662,877 | 
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ECE 
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SE ee | 


78,304 Pree Re JE) 0.002 507 


3 836 |) 198 61.39 0.026 |. 5,152 am 


40,852[} 214,330 -0.066 | - 14,146 | 
$ 


56,315],309,365 | -0.061 - 79,871 | 
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number of people" obtained by adding together the numbers in 


: 


| 


(say) each city size or each level of education agree with > 
the total number of people predicted to be in a population | 


based on a trend line! 
If the researcher uses "trend Live" projection he must 
make some choice as to how to deal with the problem of 


differing totalse One simple approach is to make the kind > 
of "trend line" projections for the number of people in 


various socio-economic categoriese These projections can be 
used to determine what part (proportion) of a total 


population defined independently would be, Say, in each. 


Level of educatione The following equation then applies: 


(Estimated number in tevel 1 
for variable X) 


COPLESC ESAS Nui mibye 1 rr ra ee ee Ore ch 
in level 1 (Sum of number in each level Total" 
for variable xX - for variable X) 


One other example of where problems have occured (where 
census information is not available on certain 
characteristics) is the following in which estimates were 
needed on income of household heade The problem arises in 
trying to procure information for each member of a household 
by income of head of householde This is obviously needed in 
the male and female models for which results were generatede 
Given that it cannot be obtained directly, an approximation 
approach was adopted in which the head of household income 
information for the total population was used, and accepting 
the fact that there will be some error if one simply says 
that males and females will have the same distribution of 
income of head of householde When the assumption noted is 
made, it is not sufficient to simply take the income of head 
of household distribution divide by two and call half the 
population males and half femalese AS was indicated above, 
it is necessary to use the distribution information for the 
total population to derive a distirbution such that the 
numbers in each level of the variables being considered do 
add up to the T( I) for that populatione So, the distribution 
was expressed in percentage terms and T( I) was allocated 
according to such percentagese 


DISCUSSION 


Although it is a simple matter to make projections in 
the way that has been described, this does not mean that the 
procedure is correcte There are a number of different 
concerns which may be raised heree By using a particular 
model one may feel that it is implied that the structural 
adequacy of the model is accepted and the accuracy of the 
projections is not taken to be a point in questione 
However, the reader will realize that early in the paper it 
was indicatea that only certain matters of concern about the 
model were to be pursued in this papere One of those was 
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| 


| 

that the coefficients of the model were computed for nine 
socio-economic variables whereas in the actual analysis only 
three of these socio-economic variables were usede In 
statistical analysis, it is well known that the 
interrelationship between two variables - for example age 
and income —- may be such that a coefficient which is used to 
take into account the effect of age may also reflect the 
effect of incomee In the technical literature this is known 
as the multi-collinearity problem and an examination of the 
correlations between the variables that were left out of the 
model and the variables that were included shows that some 
of these variables are correlated. So, there is a 
possibility that there may have been an over or under 
compensation for some factors that reflect the behaviour of 
the people in the populatione Renoux has commented on this 
problem and there are numerous discussions of the problem of 
multi-collinearity in the statistics and the econometrics 
Lliteraturee . 

| Another type of. problem is that the frequency models 
defined here seem to be poor modelse Such a comment may seem 
odde However, when Renoux (1973) began to seriously 
investigate how to predict the total amount of hunting in 
Quebec, he found that results obtained using the first form 
of the frequency model often did not make much sensee That 
is why the second type of frequency model was developede It 
was concluded by Renoux and Beaman that problems with the 
"type 1 frequency model" were caused by the drastic skewness 
of the distribution of stated frequencies of participatione 


At first it was considered necessary to "correct" for this 
skewness by using weights so that the larger uncertainty 
that one may have in a person's statement that he 


participated 100 times rather than zero times would be 
reflected in estimatese However, soon the kind of problems 
with models discussed ina number of technical notes were 
recognized and it was decided that further work on a 
frequency model should be deferred until the research for 
these other notes was completed (for TN 6, 207, 29 and 35)e 

It may not be clear that there must be data on a 
specific age group before it is valid to make the 
predictions for this age groupe What is the importance of 
this? The data from which the regression coefficients 
presented here were obtained were either for 10 years of age 
and over (1972) or for people of 18 years and over (1969). 
So for hunting it is quite possible to make predictions for, 
Say, people 16 years of age and over by assuming that they 
have the same regression coefficients as people 18 years of 
age and over, if 1969 data are to be usede If 1972 data are 
used, coefficients must be defined in some special way using 
the age specific data which do not give month of birthe 
However, explicit age information need not be availablee So 
it is possible to make predictions about total number of 
People of all ages participating in some activitiese 

To further illustrate this, if one knows on the average 
how many heads of households are with each picnicking party 
or has some other measure that can be used to divide or 
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multiply a predicted figure one can translate it into using 
partiese In other cases it is possible to use socio-economic 
characteristics to infer that if a respondent takes part in 
an activity, probably a certain number of other people doe 
The individuals weight can thus _ be increased to make 
estimates of all participatione In doing this care must be 
taken that children are not counted twice when weights for 
adults are increased to reflect that children will go with 
them on trips (e€ege each parent gets half the children )e 
Still there is a large problem area of application for the 
technique that needs to be explored in terms of particular 
implications to make certain kinds of total use estimates 
that may be of interest to planners or managerse | 

In one case, an estimation problem may be overcome by 
considering that certain activities are carried out by 
family wgroupse Then family information and socio-economic 
information can be used to get regression equationsSe in 
other caseS, average - results from an adult analysis can be 
multiplied by inflation factorse But, as already implied 
what to do in particular cases involves research problems 
that remain to be solved in the future and must be solved in 
the context of each need for a special kind of informatione 
They cannot be solved in any general way unless another kind 
of data base is created on people of all ages (or in some 
cases on families)e In other words, no simple or universal 
answers can be given regarding what are the best data or 
what is the easiest way to arrive at acceptable estimates of 
total participation, number of people who participate in an 
activity etce | 

On a very different matter it may have been noticed 
that this paper suggests the use of a cross-sectional 
equation for making predictionse TN 13 points out some of 
the concerns that arise because the coefficients of the kind 
of model proposed here are derived for a certain point in 
time and may be changing over timee Also there are comments 
made in this volume about the importance of recognizing that 
the relationship (previously referred to as collinearity) 
between, for example, age and some other variables actually 
reflects what may be treated as a causal relatione It is 
truly unfortunate that no CORD Study research has pursued 
the matter of developing causal models, for example the kind 
of path-analysis models that have been described by Blalock| 
(see Reference 3)e However, it is the case that in the CORD 
Study, socio-economic variables that do influence each other 
in aecausal way and do not influence participation 
"independently" have been usede This approach, at best, is 
an approximation that creates difficulties in using the 
model for predicting the futuree | 

Another point worthy of discussion is that the mn ockeae 
introduced treats individuals as participating in one 
activity or another independently of their participation in 
other activitiese But it has been indicated in TN 10, 32 and 
37 (and as is well accepted among researchers) people's 
participation in one activity is related to their 
participation in other activitiese Any model Like the 
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present one which treats behaviour on an activity-by- 
activity basis may have structural problems related to the 
fact that activities are treated autonomously. Certainly 
when one considers making predictions of what will happen in 
the future, there is a danger that by using the model 
proposed here one is suggesting that what is important in 
determining peoples future behaviour is their socio-economic 
characteristics rather than their orientation to a variety 
of recreational activitiese In fact, one important factor is 
definitely what happens in the development of the supply of 
facilities for different activities (on supply and 
participation see TN 29)e In part, developments that take 
place with respect to supply affect participation, not in 
terms of peoples participation in individual activities but 
in terms of how in the future they allocate their finite 
time according to some kind of time budgete This depends on 
the supply of opportunities for a variety of activities that 
they find in the future as opposed to the supply with which 
they are confronted in the presente 

| The development of the Blackstrap Ski Development in 
Saskatchewan has had a drastic effect not only on the amount 
of skiing in Saskatchewan but on activities that were taking 
place in the time now filled by skiing and obviously not now 
taking place, or at least not as muche Both change in supply 
and substitution of one activity for another (as has 
happened with snowmobiling, biking and cross country skiing) 
have played important roles in altering the behaviour of a 
large number of Saskatchewan residentse The supply component 
of this change may quite possibly be taken into account to 
some extent on a single activity basis for some activities 
by incorporating a supply factor into the kind of analysis 
of variance model considered there (see TN 29)e But as 
pointed out in CORD Study TN 10, the problem of how to 
‘compute the impact of the movement into skiing on other 
activities which were previously participated in by a person 
still remains as a matter for researche There are no 
practical answers at this point in time. 


| TYPE OF DATA NEEDED FOR MAKING PROJECTIONS 
USING THE ANALYSIS OF VARIANCE MODEL 


In previous sections of this article it has been 
pointed out that census data were used to make certain 
projectionse In practice there has been a continuing problem 
in getting good agreement between census definitions and the 
definitions used in the CORD Study National Surveys on which 
BC )*¥'s are basede Also, published census results often do 
not give information such as education by age for males, or 
even income of head of household by sex of household member, 
which is necessary if the present model is to be used with 
income as a variablee Education by sex by age data are 
needed if the fact that education means something different 
for young people than old people is taken into account ina 
slightly more general model (see TN 20)e The point is that 
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for a model such as the one introduced to be used there must 
be a great deal of care taken to see that survey definitions 
do correspond with census definitionse What is more, if 
special tabulations from census data are required, plans 
must be made well in advance of carrying out an analysis so 
that special tables can be produced and ready when 
projections are actually to be madee 

The biggest complication in using the kind of model 
described, insofar as data are concerned, is that it is not 
the straightforward matter that one might suspect to make 
projections of what the age distribution of the population 
will be at some future time, or what the education 
distribution will bee If one gets into the matter of trying 
to predict the breakdown of education by age at some future 
time, the problems are truly very complicatede Now, it has 
been the case that statistical agencies responsible for 
making projections have refused to make the projection 
necessary to use the- model described because they say they 
cannot make the required estimates accuratelye But here one 
enters into a rather delicate problem of whether “accurate" 
projections are really needed for planning purposese It is 
better to plan on the basis of the best projections that can 
be made rather than not plan at alle 

It is not a straightforward matter to decide how good 
projections should be before a model should be usede But one 
can say that for using the model described here, there is no 
need to spend fantastic amounts of money on getting 
“extremely reliable" estimates of what the age distribution 
or education distribution is going to be in the futuree The 
results predicted are not going to be accurate to the same 
degree as the predictions on the distribution of education 
Or ake, or none of the distributions will be accuratee As 
long as demographic variables are predicted with an accuracy 
somewhat greater than one feels is implicit in the analysis 
of variance model, little more is necessarye Certainly 
nothing is gained by having accuracy in population figures 
which is lost as soon as regression coefficients are applied 
to make predictionse (This and other matters are pursued in 
TN 6 and TN 20¢ ) 


CONCLUSION 


This paper has presented a model for predicting 
participation in Outdoor Recreatione However, there has not 
been a wholehearted endorsement of using the proceduree 
Rather the paper should have provided the reader with a 
clear understanding of how the analysis of variance 
technique can be used should it appear better than other 
alternatives which might be used to generate the same kind 
of informatione It might be chosen because other methods 
appear more costly or are not feasible for other reasons 
(ecge data cannot be collected and analysed in the time 
available )e 

An important point to recognize is that the recognition 
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of how such a model can be used is an important research 
step towards developing more sophisticated modelse There is 
certainly a need to develop such models so that extremely 
expensive surveys need not be carried but so that equally 
good results can be obtained by using predictive models 
along with census informatione 

Even now, for all the criticism that can be leveled 
against it, the model described here is good given that the 
present ability of researchers to make predictions is very 
Limitede Massive amounts of money can be expended on surveys 
when the effort might better have been spent on trying to 
develop scenarios of what will happen in the future and thus 
to relate current behaviour to the way the population is 
likely to behave in the futuree It is this point that is 
stressed in CORD Study TN 13. This latter note, to a certain 
degree, defends the use of the technique described heree 
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Append1rx to’ TN’ £2 


Appendix: Technical Note 12 


The following appendix is a summary of the original 


version 


estimated parameter value and 
OL estimated parameter for "(Ne given “actlvitzres 


specifying (1) the Beta Values, (2) the 
(3) the standard deviation 


presented for both males’ and females. 


INTRODUCTION 
A= The 8-M Surveys 
B - The Analysis of Variance Model 
C - How to Read Tables and Figures 
D - How to Interpret Results 
1. The General Means 
2. Beta Values 
3. Standard Deviations of Parameters 
4. The Figures 
5. Coefficients of Determination, R2 
6. ssCOoerracients.of Correlation 
7. Evaluation of Estimates 
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APPENDIX * 


* Numbers 1-91 indicate the numbers of the Figures in 
which Beta Values for the particular activity are plotted 
for Males and Females. 

trable 1-91 for ‘all the above ‘Figures contain the actual 
estimated parameter values for the given activity for 
Males and Females. 


‘For all the above Figures Table 1-A to 91-A are where 
the standard deviation of estimated Parameter Values for 
mene given activity are presented for both Males and 
meemales. ) 

| 


| 


* 1. Beta Values for “Swimming Participation in City" 
- Males and Females. 


2. Beta Values for “Nature Study or Bird Watching 
Participation in City" - Males and Females. 


3. Beta Values for "Outdoor Photography Participation 
in City" - Males and Females. 


4. Beta Values for "Visiting Historic Sites or Parks 
Participation in City" - Males and Females. 


5. Beta Values for "Visiting other Parks 


Participation in City" - Males and Females. 

6. Beta Values for "Driving for Pleasure Participation 
in City" - Males and Females. 

.7. Beta Values for "Sightseeing - Participation in 
City" - Males and Females. 


8. Beta Values for "Snow Sledding or Toboganning - 
Participation in City - Males and Females. 


9, Beta Values for "Picnicking Participation in 
City" - Males and Females. 


moO, Beta, values ..or “Walking or Haking Partie1parion 
in City" - Males and Females. 


i. Beta Yalues for “Golfing - Participation in 


City" - Males and Females. 

12. Beta Values for "Ice Skating Participation in 
City" - Males and Females. 

Woo wbeloevalues tor “Bicycling Participation” 10 
City" - Males and Females. 
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Beta Values for "Swimming Participation in 
Country" - Males and Females. 


Beta Values for "Nature Study or Bird Watching 
Participation in Country" - Males and Females. 


Beta Values for "Visiting Historic Sites or Parks 
Participation in Country"’- Males and Females. 


Beta Values for "Visiting Other Parks Participation 
in Country" - Males and Females. 


Beta Values for "Driving for Pleasure Participation 
in Country" - Males and Females. 


Beta Values for "Sightseeting Participation in 
Country" - Males and Females. 


Beta Values for "Snow Sledding or Toboganning 
Participation in Country" - Males and Females. 


Beta Values for "Picnicking Participation in 
Country" - Males and Females. 


Beta Values for "Walking or Hiking Participation" 
Males and Females. 


Beta Values for "Golfing Participation in 
Country" - Males and Females. 


Beta Values for. “Ice Skating, .Participation in 
Country" - Males and Females. 


Beta ,Values for "Bicyling Participation .in 
Country” - Males and Females. 


Beta Values for "Swimming Participation" - Males 
and Females. 


Beta Values for "Tent Camping Participation" - 
Males and Females. 


Beta Values for "Trailer Camping Participation" 
- Males) and Females. 


Beta.Values for "Pick-up icampang Participation * 
in Males and Females. 


Beta Values for "Hunting Participation" - Males 
and Females. 


Beta Values tor | Power eoating (Parc ici panto. a 
Males and Females. 
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Bz. 


BS. 


34. 


DD. 


Bo. 


Be). 


Beta Values for "Canoeing Participation" - 
Males and Females. 


Beta Values for “Sailing Participation” - 
Males and Females. 


Beta Values for "Water Skiing Pale. Cclpat lon... = 
Males and Females. 


Beta Values for “Nature Study or Bird Watching 
Participation" - Males and Females. 


Peval Values fOr, Outdoor, Photography Participation + 
- Males and Females. 


Beta Values for "Visiting Historic Sites or Parks 
Participation" - Males and Females. 


Beta, Vatues ror: Visiting -Other,.Parks 
Participation" - Males and Females. 


Beta "Values for “Driving for Pleasure Participation" 
- Males and Females. 


Beta Values for "Sightseeing Participation" - 
Males and Females. 


Beta Values for "Climbing Participation" - Males 
and Females. 


Beta Valuccefor. Snow kl ing sParticipation.” = 
Males and Females. 


Peta Values tor “Snowmobiling ParlLicipation ..- 
Males and Females. 


Beta Values for "Snow Sledding or Toboganning 
Participation" - Males and Females. 


Beta Values for "Picnicking Participation" - 
Males and Females. 


Beta Values for "Walking or Hiking Participation 
- Males and Females. 


Beta Values for "Golfing Participation" - males 
and temales:. 


Beta. Valyes.for “Ice Skating. Participation”) - 
Males and Females. 


Beta Values for "Horseback Riding Participation” = 
Males and Females. 
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Beta Values for 


and Females. 


Beta Values for 


Females. 


Beta Values for 


and Females. 


"Bicycling Participation” - Males 


"Tennis Participation" - Males and 


weishing Participation” --— Males 


U 


Beta. Value. for, “Hunting or Fishing Partlctpan von. 
Beta Value for "Small Game Hunting Participation". 
Beta Value for "Large Game Hunting Participation". 
Beta Value for. "Waterfowl Hunting Participation”. 
Beta Value for "CrOSsSs=COUNTYTY SKIING Participation 
Beta Value for "Downhill Skiing Participation". 
Beta Value for "Salt Water Fishing Participation". 
Beta Value for “Freshwater Fishing Participation”. 
Beta Value for "Using Canals in a Commercial 

BOat (Participation). 

Beta Value for “using Canals “in a Private Boat 
Parubei patton. 

Beta Value for "Using Canals Non-Boating Participatig 
Bota wVasue for. LHuntLing Frequency 2 

Beta Value for "Swimming Frequency”. 

Beta Value for "“Tent-Camping Frequency”. 

Beta Value for "Trailer-Camping Frequency". 

Beta Value for "Pick-up Camping Frequency". 

Beta Value for "Power Boating Frequency". 

Beta Value for "Canoeing Frequency". 

Beta Value for “Sarling Frequency”. 

Beta Value for "Water Skiing Frequency". 

Beta Value. for “Nature Study or Bird Watching 
Frequency". 
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"Outdoor Photography Frequency". 


"Visiting Historic Sites or Parks 


"Visiting Other Parks Frequency". 
"Driving for Pleasure Frequency". 
"Sightseeing Frequency". 
"Climbing Frequency". 
"Snow-Skiing Frequency". 
"Snowmobiling Frequency". 


"Snow sledding or Toboganning 


"Picnicking Frequency". 
"Walking or Hiking Frequency". 
"Golfing Frequency”. 
"TIce-skating Frequency". 
"Horseback Riding Frequency". 
"Bicycling Frequency". 

"Tennis Frequency". 

"Fishing Frequency". 


"Hunting or Fishing Frequency". 
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CORD STUDY TECHNICAL NOTE 6 


THE ACCURACY OF ESTIMATES OF PARTICIPATION 
AND FREQUENCY OF PARTICIPATION 
| USING THE ANALYSIS OF VARIANCE MODELS 


Je Beaman and ‘Me Alvo 


ABSTRACT 


| The purpose of this paper is to present the derivations 
of general expressions for the error to be expected when 
making predictions of frequency of participation and number 
of participants in recreational activities using linear 
regression and analysis of variance modelse 

The case of concern is when an analysis of variance 
model is developed on the basis of statements made by 
individuals about how frequently they participate in certain 
activitiese Well known results on the variance of estimable 
functions are used to obtain estimates of the variance to be 
expected in making predictions of the total number of male 
Participants in hunting and in fishing that can be expected 
in Quebece The estimates made are for a given year taking 
into account the socio-economic characteristics that the 
provincial residents are estimated to have in 1980. 

Details on the actual numerical computations are 
provided and illustrated so that the reader recognizes some 
of the operational problems that arise in estimating the 
accuracy of predictionse The difficulties are shown to be 
related to the way that computers obtain permeter estimatese 


PURPOSE 


The purpose of this note is to show how the accuracy of 
estimates made using an analysis of variance model can be 
assessed, when the model used can be accepted as being 
structurally sounde 


INTRODUCTION 


One of the models developed as part of the Canadian 
Outdoor Recreation Demand Study can be expressed as 
indicated in the formula below: 


Probability of Participating in an outdoor recreation 
activity = General mean + Income effect +t Age effect +¢ 
Education effect + Family Composition effect + Urbanization 
etfecte 
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In a mathematical form the equation is as follows: 
(1) P = U + B(1,J) + B(2,K) + BCS,L) + BC 4,M) + BC 5,9) 


WHERE 

P = the probability of an individual participating ina 
particular activity and 

U = the constant for a particular activity (general mean) 

B(1,J) ece B(5,Q) = the beta coefficients that apply to each 
category that defines the socio-economic 
status of the person being considerede 


The value of the coefficients that were estimated for 
hunting are shown in Figure le On how to use the 
coefficients and tow to read the Figure, should this be a 
problem, one may refer to TN 12-6 

The best known use of the analysis of variance 
technique, which resulted in the kind of model just 
described in relation to making predictions of participation 
in recreation activities, is the application of the 
technique by Mueller and Gurin (see Reference Q9)e More 
recently there have been a number of applications of the 
techniquee In the Canadian Outdoor Recreation Demand Study 
it was originally recommended that the technique be applied 
by Hendry (see Reference 8)e Subsequently TN 12 was prepared 
and as well two works have been prepared in Quebec by Renoux 
(see Reference 11 and 12)e 

What is important to this research report is 
recognizing that one of the equations for making predictions 
which is in TN 12 is what statisticians call an estimable 
functione In the jargon of statistics, any Linear function 
of linear regression parameters where the coefficients of 
these parameters are numbers not considered subject to 
error, is an estimable function, so that the equation given 
below is an estimable function with known constants Ny, X(1), 
X(2), etce 


(2) E = VN + OE Bi )X( i) 
The equation suggested for making estinates of the 


amount of participation in certain activity ina city as 
presented in TN 12 was as follows: 


(3) ECT) = UN + OE Bl1,j)n(1,j) + ODE BC 2,k)n(2,k) 
+ @E B (3,t)n( 3,1) + DE BC 4ym) n( 45m) 
+ QE B(5,q)n(5,q) 


WHERE the sums are on js Ky ty m and q respectively; 


ECT) = expected number of participants ina particular 
activity 

U = the general mean for a particular activity 

N = the total population of a geographical area 
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CHAPTER IV 


ITN 


BC 1, Jj dee e BC S59q)2 are the effects of being in a category 
(JeeeQ) of a socio-economic variable (1Lleee5) 


nC lyjdeeon(S5,q) = the number of people in the 
population who are in a cateogory (jeesq) of a socio-~ 
economic variable (Lleee5S) 


In this paper a point is made of referring to E(T) 
rather than saying that T and the variance in T are 
estimatede This is because (1) for each individual one does 
not estimate a zero or one for participation or non- 
participation and (2) estimates are not madee 

The expected probability of participating is for the 
same population for which parameters are derived so it seems 
most appropriate to say that the estimated are "expected 
participation for the population considered." Actual 
participation in a given geographic area ina given year 
could vary from an expected value for any number of reasons 
yet the expected value which was predicted using a model 
could be more appropriate for planning or policy than a 
predicted T that was really appropriate to a given place in 
a particular year (Expo 67 year or Olympic Games Year)ae 
Regardless, the main point is that E(T) is estimated for a 
city it has the variance which is estimated heree If 
however one is interested in the variance in T, the actual 
total participation, then each person does not contribute 
the variance in his predicted Ely), a variance that 
decreases with the Size of survey on which the estimate is 
basede Each person contributes a variance which, if p is 
his estimated probability of participating, is p(i-p) and is 
a variance that does decrease as p is estimated based on 
larger and larger surveyse The matters just raised, if not 
clear to the reader, may be pursued by studying the ideas 
presented in TN 35-6 

By comparing the Equations 2 and 39 one can see that 
Equation 3 satisfied the definition for an estimable 
function and therefore the statistical theory that applied 
to estimable functions can be applied to find out what the 
accuracy ina prediction is which is made using this kind of 
a function to make an estimatee 

Nows before proceeding to presenting details on the 
statistics involved, and still in the way of an 
introduction, it is important to note that the theory that 
is reviewed here is only appropriate if the model that is 
used is structurally appropriate to the data on which the 
model is based (to the situation that is being modelled )-« 
This matter thas been taken up in CORD Study TN 206 
Actually, the conclusion reached there is that the simple 
kind of model dealt with here is not usually structurally 
adequatee Fortunately, the theory presented here is quite 
appropriate to more complicated "generalized linear modets" 
in which the interactions between the various socio-economic 
variables that must be considered to determine one's 
participation in an activity are taken into account so that 
the model defined is structurally adequatee 
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For the reader who is not fully aware of what makes a 
model Linear, it is,y from the point of view of the theory of 
concern here, that the model is Linear in the parameters 
that are estimatede The coefficients of these parameters 
can be the squares of observations or almost anything one 
can imaginee One who is concerned about this matter may 
wish to pursue it further in such references as Netter and 
Wasserman (see Reference 10) or Scheffe (see Reference 13). 
Also, they may wish to note that there are innumerable ways 
of carrying out the kidn of analysis that is described heree 
One of these is referred to by economists as a dumay 
variable analysiSe For example Goldberger (see Reference 7) 
indicates something about both what has been traditionally 
called an analysis of variance with normal nominal variables 
and dummy variable analysis, and indicates why these two 
forms of analysis are just algebraic equivalents when 
interpreted in the right waye 


THEORY AND METHOD 
Appropriate Measures of Error 


The theory which is introduced subsequently gives a 
procedure by which one can estimate how much error may be 
expected in a prediction made using an estimable function 
type modele In fact what one finds from the theory is an 
estimated standard deviation, Sy or estimated variance, S@, 
for the estimate madee Sometimes by chance an estimate may 
be very close to the expected value that would be observed 
if one had perfect knowledge of a populatione Other times, 
simply by chance, there may be a quite substantial error in 
a predictione This means that once the computational 
procedure for estimating the amount of participation im an 
activity has been defined it is reasonable to question the 
accuracy of this estimatee Accordingly, let E(T) represent 
the expected amount of participation in an activity fora 
given populatione A degree of confidence expressed as a 
percentage can be used to indicate how sure the researcher 
wants to be of his estimated valuee Thus, to assess 
accuracy one must determine an interval as including 
estimates which are acceptably close to E(T)e 

The width of such an interval is a function of the 
degree of confidence chosen as well as of the variability of 
the estimatee Generally speaking, the higher the Level of 
confidence, the larger the interval will bee Also the 
larger the variation in the estimate the larger the 
intervale As an illustration let the "width" as shown in 
Figure 2 of a symmetric interval around E(T) be defined as C 
times s (SCS) where c is a constant determined by the degree 
of confidence specitied as well as by the sampling 
distribution of estimated values of E(T)e This means that, 
if the estimate of E(T) has a normal distribution and if one 
wants to be 95% confident about the interval in which the 
ECT) Lies one should let c=1e96-6 A convenient measure for 
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accuracy is the value of CS3 or if possible, the percentage 
of error defined by: 


(4) % error = 100 * CS/estimate of E(T) 


But caution must be exercised in using (4)e If s is large 


compared to the estimate of E(T) the %® error as defined 


above will be unreliablee Still the advantage in using 
equation (4) as a measure of error is that the measurement 
_is made relative to an estimate of E(T)e 


Ihe Variance in Estimates 


It would be a simple matter to explain how to compute 


the variance in an estimable function if it were not for the 
'fact that computers do not estimate parameters in models in 
the way that one might expect that they doe Some of the 
information generated-by the computer while it carries out 
its computations is essential in estimating how much 
“variance there is going to be in a predictione 
Consequently, the following discussion deals specifically 
with the way in which a particular computer program computes 
the coefficients in the kind of equation introduced earliere 
The method is actually very similar to that used in a number 
ot standard computer programs so that the discussion should 
be sufficient to indicate in all the detail necessary what 
can be done to utilize information which can be extracted 
| from most computer programs (if a systems analyst locates 
| the appropriate point in the program and inserts a 
| subroutine to dump the various arrays needed)e 


So, suppose that the amount of participation in hunting 


by the male population of Quebec is to be predicted for the 
year 1975. The data used are the 1972 Study National Survey 


Information on Canadians" participation in outdoor 


activities. (see Volume III) which contains information on 
' respondent's participation in hunting and fishing and, as 
well, information on their income, the size of place in 
which they Lived when they were interviewed and on their age 
when they were interviewede Table 1 shows how the data on 


1,271 males from throughout Canada were recoded for’ the 
purposes of the analysis reported heree The original data 
had more codes for the three variables income, urbanization 
and agee As well, information on fishing and hunting was 
the actual number of times that the person interviewed 
stated that they went hunting or fishing in the past yeare 

Table 1 actually presents much more information than an 
indication of the different levels of the socio-economic 
variablese It is set up in such a way as to facilitate the 
description of how a computer finally arrived at the 
estimates of the regression coefficients which are given in 
the tablee When the computer had read the data, which one 
may think of as indicated by X's in the second column of the 
table, a number of statistics were computer and the data 
were transformed into the variables indicated by the X's for 
the purpuses of actually carrying out regressionse 
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| But before proceeding to comment on the means and 
standard deviations computed it is important to note that 
the headings, under "Variable Type" indicate that income, 
urbanization and age were treated as nominal variablese 
This means that though the variables were read in as having 
values such as 1, 2 and <3, they were transformed as 
indicated into variables which indicate whether a person had 
a given characteristic or note Specifically for income if a 
person has income between 0 and’ $5999 then X(1,1) = 1 and 
otherwise it is Oc The third level of this variable is 
indicated as X(1,E) and has the indication after it 
-X(L,)-X(1,2) because this variable is not kept in the 
actual estimation of parameters but rather wherever a person 
is noted as having an income above 10,500 dotltars per year 
(as of 1972)-e This is recorded by indicating that X(1i,1) 
and X(1,2) have the value of -1ie What this does is cause 
the information on all individuats to be stored in sucha 
way that the regression coefficients estimated will be such 
that the sum of the first two income coefficients will be 
equal to minus the third coefficiente A simpler way of 
stating this is that for each variable income, urbanization 
and age, the way that data are defined results in the sum of 
the regression coefficients being equal to zeroe 

In actual computations this means that if a variable 
has three levels, only two regression coefficients are 
computed and the third is determined by adding the other two 
together and making the third coefficient equal to minus 
their totale Much is made of this here because it is very 
important in understanding what followse 

From the preceeding one can see that the capital X and 
xX variables are different. This is why the results shown in 
the table indicate that the untransformed variables and the 
transformed variables have different means and standard 
deviationse However, the reader who is examining Table i 
may be confused by the fact that there are not just one set 
of transformed meanSse There are four sets of such meanse 
The reason for this is a consequence of the actual 
computation of regression coefficientse The first 
computation of a coefficient was carried out assuming that 
€ach observation had the same importance and should have the 


Same importance in determining coefficients so each 
observation was given a weight of onee However, when the 
second estimation step was carried tne weights of 


observations were defined by the equation: 


u( i) = ESY (1l-rESY) if eO1 < ESY < .99 
and 
100 otherwise 


WHFRE ESY is the estimated probability of the individual 
participating estimated in the first iteratione 


One may see the Smith and Cicchetti paper on using 


weighted regression which appears as an appendix to this 
volumee The reader may find it interesting to note in Table 
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1 that the means and standard deviation for fishing and 
hunting are exactly the same for the first iteration, the 
iteration with the observation weights equal to onee They 


will be the same for any activitye Only the means and 


variances for the dependent variables change for the obvious 
reason that a different dependent variable is being 
considered in the two casese Also as indicated in a 
footnote in the table, the values Listed for "standard 
deviations" for the transformed variables are not really 


standard deviations because they have not been divided by 
the appropriate weight to transform them into standard 
deviationse The reason for this is that all that is needed 


by the computer is the sum of crossproducts not variance 
estimates or standard deviationse 


The computer actually solves for the B (e) in the. 


following equation: 


(6) yCe) = UtFB (1,1) x (1,1) + B (1,2) x (192) 
+B (2,1) x (291) + B (2,2) x (232) 
+B (3,1) x (3,1) + B (3,2) x (3,2) 


WHERE y (e) indicates an observation for a person with 
certain socio-economic characteristicse If the person being 
considered had income level 3, urbanization level 1 and age 
level 2,5 by note (1) on Table 1 one can see that: 

(7) ye) UtB (1,1) (-1) + B (1,2) (-1) 

*#B (02,1) (1) + B (2,2) (0) 
£BiG3~.1)560)..% BuUG3-2) (C1) «t+ 8.0353 ),.0C0) 
U+(—-B(1,1))*+( BC 1,2) )*+B( 2,1 )+B( 3,2) 


In a matrix notation one may write the general equation 


above in the form (see Reference 13) as: 


(8) y = Ut xB 


For the initial computation of parameters called 
Iteration 1 in Table 1, the covariance matrix for the 
observations is considered to be a diagonal matrix with all | 
diagonal elements being equale On the second computation, — 
Iteration 2, the variance in each observation is defined by 
the inverse of the weights w(i) introduced earliere The | 
covariance matrix of the y's, Dy is considered to be a 


diagonal matrix with ones 1/wli) on the diagonal depending 
on the iteratione 

Now in matrix terms the least squares estimates of the 
Ble )*s are computed by: 


(9) B = (x"*DIx)**-1 x"DIy 
WHFRE Bleze) = — OE Bleszi) and D is the convariance matrix 
of the observations with an inverse Dle 


It is well known result (see kKeference 13) of least 
squares estimation that the covariance matrix for the 
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parameters B(e) estimated is proportional to M which is the 
inverse of (x'DIx)e The covariance matrix is s*M where s@ 
is the regression residual sum of squares divided by its 
degrees of freedome But the computer does not compute Me 
(x"*DIx) is converted to a correlation matrix by removing the 
standard deviation values given in the "Means and Standard 
Deviation" part of Table 1 for hunting and fishinge On the 
first iteration computation for both hunting and fishing for 
example, (26¢32)2, (27-02)2 etce appear on the diagonal of 
(x*DIx) but these are removed to get the correlation matrix 
C given at the top of Table 2e The following shows two 
relations of importance: 


(Inverse of M) = SCS* = x'"DIx 


WHERE S is the row vector of "standard deviations" for the 
x's for the appropriate iteration, and 


M = (1/sede ) ic (1/Ssede )! 


WHERE (1/sede) is the row vector with each element being the 
inverse of the same element in S and IC is the inverse of Ce 


For Iteration 1 for both fishing and hunting one may 
note that in terms of the last equation: 


(12) (x"DIx) = 


| 1.26 —e38 e 24 —e04 —-ei4 e21 e29 i 
| —e3& 1.13 —-eOil —.e02 —e05 —e1l3 — 004 | 
{| 24 —e01 1-67 -—-1-00 -—e09 -—e07 012 | [1/sede ] 
| -—.04 —e02 -1.00 1-61 —«06 ©03 -.02 | 
| -.-14 -e05 --09 —206 1296 —e77 —-e75 | 
| e21 —e13 —e07 03 —eJ7 2001 —el2 i 
| 029 —e 04 e012 —e02 —e 75 -—e72 1.99 | 


and using the iteration 1 Sede values from Table i, one has: 


(13) x*DIx = 


ee 


[| 0018 -—e09005 e003 —e0001 —°9002 -e0003 e0004| 
| —.-0005 ©0015 -—e0 —e-0 e 0 —2e0002 20 l 
|! .0003 -e0000 ©0016 --0013 29 20 -0014] 
| -e0001 —-e0 —--e0013 ©9027 —e0001 20 20 | 
| -—-e-0002 —-eO0 —-e0001 ae) 20027 —-e0011 —-.0011| 
! 0003 —e 0002 29 mae) —e0011 ©0029 -.0010| 
[| 0004 —e0 20014 9 —e0011 —-e0010 20028] 


This is only shown to 4 decimals but six significant figures 
were used in actual computatione 
But NOW» to finish with the matter ot parameter 
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TABLE 2 


THE FULL CORRELATION MATRIX 
AND THE INVERSE OF THE CORRELATION 
USED IN ESTIMATING THE REGRESSION COEFFICIENTS 
GIVEN IN TABLE 1 


Full Correlation in Matrix Giving Correlations Between 
All Independent and Dependent Variables 


1-0: Full Correlation Matrix Used in 
First Iteration Computations 


1-0 : FISHING O 
0037058 1-0 HUNTING QO 
—-0205115 -—0-11747 1.0 INCOME 1 
0.03686 0.06794 0.28862 1.0 INCOME 2 
—-0-13626 —0.00746 —0.19660 -0.02572 URBAN 1 
—-0-01882 0205220 —-0.-09563 0.00943 URBAN 2 
0e-19061 04221162 0612346 0.10232 AGE 1 
0016562 0.221074 —0e23054 0.09472 AGE 2 
00-11979 0218283 -—-0O-e2611it 0205287 AGE 3 
FISHING HUNTING INCOME INCOME 
O 0 1 2 
1-0 URBAN 1 
0-61507 1.0 URBAN 2 
0211825 0210514 1-0 AGE 1 
02611135 0.07753 0e63012 1-0 AGE 2 
0-06117 0.05680 0.61940 0.62916 1-0 AGE 3 
URBAN URBAN AGE AGE AGE 
1 2 1 2 3 
1-1: First Iteration C**-1 Matrix for Hunting 
and Fishing 
1.225901 INCOME 1 
—0 27745 1212887 INCOME 2 
0023854 -0-00673 1267108 URBAN 1 
-0.04053 -0.02474 -0.99660 1.61452 URBAN 2 
—-0214297 —-Q0.04776 -—-0.-09181 -0.05987 AGE 1 
0e20741 -0e13470 -0.06921 0.203039 AGE 2 
0029447 —-0.04209 0e11743 -0.02205 AGE 3 
INCOME INCOME URBAN URBAN 
1 24 1 2 
1295767 AGE 1 
—-0.77463 2.00615 AGE 2 
-—0-e75101 -0.71860 1-98048 AGE 3 
AGE AGE AGE 
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1-2: Second Iteration C**-1 Matrix for Fishing 


1.48289 


-0-19112 


—0e16122 
0- 05679 
0-00896 
0235306 
0047955 
INCOME 

1 


4217277 


1-07480 
0212685 
—0204735 
—0e12623 
—0e15277 
—0-01466 
INCOME 
2 


—2e01214 4224506 
—1.69177 -1.59875 


AGE 
1 


2-2: Second Iteration C**-1 Matrix for 


1.27098 
—0e35399 

0e20855 
-0-01738 
-0-10803 

0-24050 

0231928 
INCOME 

1 


20e 23492 


AGE 
2 


1212045 
0-01114 
—0-03822 
—-0.-15088 
-0-04290 
INCOME 
2 


—0e92815 2230860 
—02e88856 —-0-86127 


ACE 
sl 


estimation, 


hand part of Table 1 and could 
detailed analysis results shown in 
estimated standard deviations in 
one may have a feel for 


note that the 


AGE 
2 


2¢03237 
—-1.24041 42677588 
0224138 -0.17243 
0-08812 -0.01102 
0e16574 -0.01409 
URBAN URBAN 
1 a 
- 42€02737 
AGE 
3 
1.70824 
—-1.05457 1.267592 
—-0.05452 -0-08860 
—-0204705 0.01560 
0012654 —-0202587 
URBAN URBAN 
1 2 
2e28141 
AGE 
3 
OT OZOTOLS: 
obtained 


the parameters 
computations using Equation 


are given so that 


individual 
statistics 
presented in Table 3, 
explained was 
results were highly 


variance 


TN 6 


coeificients 
tor the 


aree 
various 

where one 
not very highe 
Significant as indicated by 


(9) are presented in 
be plotted, 
Figure le 


Finally 
regression 

sees that the percentage of 
all of 
the F-test 


CHAPTER IV 


Hunting 


carried 


INCOME 1 
INCOME 2 
URBAN 1 
URBAN 2 
AGE 1 
AGE 2 
AGE 32 


AGE 1 
AGE 2 
AGE 3 


INCOME 1 
INCOME 2 
URBAN 1 
URBAN 2 
AGE 1 
AGE 2 
AGE 3 


AGE 1 
AGE 2 
AGE 3 


different 
the right 
as were the more 
One will also 

the B (e)'*%s 
how accurate the 
"standard" 

out are 
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‘the 


results given: F greater than 2266 has a probability of less 
than e01 of occurring by chancee 

However it is not part of the purpose of the authors to 
use this paper as a vehicle to discuss the regression 
results, though some points are raised in the "Discussion" 
section of the papere The important result in terms of this 
paper is that in making an estimate of the number of Quebec 
males participating in hunting and fishing, a computation 
must be made using the B (.-)'*s in Table 1 and the population 
distribution figures given in Table 4. The equation used to 
make the estimates given in the next to the bottom Line of 
Table 4 was obviously: 


(14) E = NU + OE n(i,j) Bliss) 
WHERE the sum is over all appropriate i and je 


But, this equation contains Bli,j)'s for which there 
are not covariance elements in Me Furthermore the elements 
in M are for observations about their meane But consider 
the following which is derived based on the well-known 
relation shown in the last line of Table 1: 


(15) E NC¥Y — OE Blisgj) RC igi)) + DE nligJj BCigj) 


(16) E NOY + OE Blisgj) (nl isgj)) — NRC isgsj)) 
WHERE if weighted regression results are used ¥Y and x(i;j)'s 
must be the appropriate weighted meanse 


Now the measurements are about the means, but the fact 
that Bli,zeJ= O2E Bli,yj) over j not equal to e, must be 
introduced to remove the Bli,e)'s from the equatione This 
results ins 


(17) E = Ny + OE BlisgyjMnli,sj) — nlize)) —- 
N(X(C igi) — XCize))) 


In matrix notation one can writes: 
(18) E = Ny + mB 


WHERE m is the vector with elements nli,j) -— nlisye) - 
N( XC iyj) -— XC ixge)) and 


B=(B(C1,1),B(1,2),B(C 2,1); eco eCtce ) 


Regardless, whether one uses the matrix equation just 
specified or the other form of the equation one has an 
estimable function as defined earlier in the papere What is 
more, since one also has an estimated covariance matrix for 
the B (Ce)"s,y one could use the formula for estimating the 
variance in E except that there is variance ina prediction 
due to variance in the y mean, and the parameter covariance 
matrix only interrelates the B (e)'se But this is because 
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the B(Ce) were computed ina space from which the variance 


related to y has been extracted 50 the appropriate 
covariance matrix to use with the estimable function is: 


@E wli) | 0) 
| 


s*V¥V = Beet tape ap gs ee A pe ee ee 


l 
6) | s?@M / 


WHERE, as before, s(2 is an estimate of error variancee 


Then by the standard formula for the variance in an 
estimable function (see Reference 13 )e 


(20) For iteration 1, W(i) = 1271 (See Table i 
column 6 or 10) and the n*s from Table 4) 
Var (E) = (s2N2/-dE Wi) + s2? mMm!* 


Using the X(e)'s from column 4 of Table 1 and the n'’s 
from Table 4, one notes that with population figures in 
thousands, for hunting or fishing first iteration: 


N= (-146,-117, 263,178, 138,228,288 ) 
WHERE N is the vector of nl(i,j)'s 


Now yg the above N is appropriate for both first 
iteration hunting and first iteration fishing because the 
means used in computing it do not depend on the dependent 
variablee For example the first element in "m" has to do 
with the first level of the first variable, MC 1,1)e From 
Table 4 n(1,) = 685 while n(il,ze) = 4856 The related means 
from column 4 of Table 1 are 203753 and .2006 so that: 


m(@1,) = 685 — 485 — 1980 («3753 — e2006) 
= —146 


Actually the difference in the meanS, e3753--2006 = 
01747 can be read from Table 1 in the columns 6 or 10e This 
is important because to obtain the vector "m" for a weighted 
analysis, Iteration 29 one should not use the means from 
column 4 of Table 1 because these are unweightede Relevant 
weighted means only occur in columns 8 and 12 so one has: 


weighted y means for hunting (cole 12) = e366 
weighted y mean tor fishing (cole 8&8) = e0455 
weighted x(1,1) mean minus x(1,e) mean for hunting 
(col 12) = 224856 
weighted x(1,1) mean minus x(1l,e) mean for fishing 
(colt 8 ) = .-90883 


The values just cited and not the columns 4, 6 and 10 
values must be used in computing "n" for a second iteratione 
So, for the second iteration hunting with numbers in 
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1000*s one has: 


685 — 485 — 1980 ( e249) = -—292 
810 — 405 — 1980 ( e196) = -64 
1093 —- 554 —- 1980 ( e145) = 252 
m* = 333 — 554 — 1980 (-e216) = 207 
398 -— 581 - 1980 (-e272) = 351 
448 - 581 -—- 1980 (-.-290) = 441 
553 - 581 — 1980 (-.-269) = S04 


For fishing or hunting first iteration working with 
populations in 1000's, one has: 


(21) VARCE) 


= (203812) ((€1980)2/1291) + (218.32)) 
SeD in E = 


standard deviation in E = 21-89 


WHERE the e381 is the value of s obtained from the standard 
error row of Table 3 and the 218-32 comes from N*VN using 
the V defined in Equation 13-6 


The V-matrix for second iteration hunting was computer 
using the "standard deviations" given in column 13 of Table 
1 and the last matrix in Table 2 so that for second 
iteration hunting: 


(22) VAR(E) 


= (1.002) ((1980)2/5956 + 99.02) 
SeD in E = 


standard deviation in E = 27-556 


Now, applying the accuracy formula given in Equation 4 
with c = 1.96: | 


%® error in hunting at e05 level = (196) (27655) (100)/ 774 
= 62€60% where 774 is obtained from Table 43 


%® error in fishing at -04 level = (1696) (21-89) (100)/386 = 
11.13% where 386 is obtained from Table 4e. | 

Actually since one can get the first iteration VAR(E) 
tor hunting by replacing (e381) by the similar regression s. 
for hunting of (2471) (see Table 3) one may note that based 
on the first iteration VAR(E) for hunting = 
( 6 471 )2( 21-89 )2( e381)? or in standard deviation terms for 
hunting first iteration the standard deviation in E = 272060 
The estimate would have been just as accurate if the second 
iteration had not been carried oute This point is pursued 
subsequently (in the discussion )e | 

Noting that first iteration estimates MAY BE AS 
ACCURATE AS second iteration estimates one may wonder how. 
accurate estimates would be if it was assumed that socio-— 
economic factors need not be considered and thus the 
participation figures shown in the Last Line of table 4 were 
simply computed by E(T) = means of the dependent variable 
times N, the total number of people being considerede . 

Now if one is interested in the accuracy of these 
estimates then one is assuming that all people have the sam 
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probability y of participatinge One can ignore the M 
matrix, and by the above, compute: 


E Ny 


VAR[ E] ([N] (€S2/1271) [N]® 
(Te S*/1271) 
(T2/1271) (TSS/1270) 

/ 
WHERE S* is’ now TSS, total sum of squares, divided by 1270 
because only 1 degree of freedom is involved in computing 
the mean of the dependent variablee Actually because Y has 


a binomial distribution one may just as well use: 


VAR(E) = N2 -y (1-9¥)/1271 


This is based on the variance in the mean of the sample 
of size N from a (0-1) variable being estimated by p(1—p) 
and on the fact that K times a random variable with variance 
x having a variance K*xe 


Using either formula one obtains: 


standard deviation hunting = Var (E for hunting )**( 1/2) 
(1980) (23910 1-- 391 0/1271 )*¥*( 1/2) = 27211 


standard deviation fishing = Var (E for fishing )**( 1/2) 
(1980) (€-194(01-.194)/1281 )**( 1/2) = 21-97 


The standard deviations for the estimates made usiny 
the mean do not differ from those obtained when socio- 
economic effects are considered except in the third 
Significant figuree One can reasonably ask: Why consider 
socio-economic should be considered if accuracy of estimates 
is not increased? 


DISCUSSION 


Given the results presented in the last section the 
first question which seems appropriate for discussion is the 
one just raised: “Why consider socio-economic effects?" As 
indicated by the results presented, the predictions made 
using the general mean are as accurate or more accurate than 
projections made considering socio-economic effectsSe Why 
bother with socio-economic effects? Using the general mean 
to make estimates is based on the assumption that socio-— 
economic effects remain constant and on the assumption that 
the population for which an estimate is made is the same as 
the population for which the general mean was obtainede If 
om, fact the desire to make an estimate for a population 
based on a biased sample, on a sample that for some reason 
is distributed differently than the population for which 
estimates are being made y it is necessary to take into 
account the differences in distribution as reflected by the 
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kinds of effects computed and considered using the model 
presented heree If this is not done the estimates made 
using the general mean may appear to be very reliable but 
they may not be valid in the sense that they are biasede 
The bias can be very substantial compared to the error that 
may be related to making an estimatee Obviously, if the 
general mean is used to make an estimate for a population 
that is much more aged than the population on which the 
exeneral mean is based, then participation in most activities 
would be over predictede The important factor in this error 
is the bias due to failure to consider age, rather than any 
inaccuracy in the general mean which was usede 

For the data used the bias involved in using the mean 
to make predictions is so small that one can accept that 
differences are "strictly statistical". As one can see from 
Table 4 the differences in predictions made using the mean 
and ones considering socio-economic factors are under 1-56 
times the appropriate standard deviation in E(T)e 

Actually, the socio-economic characteristics would play 
a more important role if they contributed more to the R@ for 
the modele However, the fact is that they do not contribute 
much and this is simply a reflection of the fact that Socio- 
Economic characteristics are not a terribly strong 
determinant of peoples participation in activitiese They 
are still a determinant which can be taken into account to 
improve predictionse Using these guards against differences 
in asample andthe group for which estimates are made 
resulting in a biase | 

Turning to another point, the fact that sample size 
gives a good estimate of the variance of estimates in which 
socio-economic effects thave been considered means’ that. 
sample sizes needed for certain accuracy can be computede 
Consider for example that: | 


max % error 
acceptable (AF) 
at a given = (100 Cc (N2 $(1-F$) * | 
level of sample size)**1/2)/ E(T) 
confidence X 


AE 


HW 


WHFRE all variables are as defined earliere 


From the equation above, given that E(T) ) 
will be close to pN in value: | 


1002C2N2Y( 1-7 )/C AE )2P2N2 ) 
1002cC2(1-y )/( AE)2P 


II 


Sample size 


So, if one desires AE=+t5% accuracy at the ©05 level 
(c=1-e96), one is not concerned with the size of population, 
N, for which estimates are to be made but must also accept 
that the accuracy is only needed for greater than some. 
value, say, el1lDe Then one has: 
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1002(1-96)2(1— 21)(0€52 (0-1) ) 
3600 * 1.96% 
13,829276 


I 


Sample size 


It 


The reader may recognize the formula just introduced as 
the well known one used in setecting sample sizes in surveys 
but should recall that its relevance here arises because of 
the fact that correcting estimates for the socio-economic 
characteristics of populations considered does not result in 


estimates which are much more "accurate" if socio-economic 
effects need not be use to correct for biase Certainly if 
the population for which estimates are made differs 


drastically from the population on which an estimation model 
is based the accuracy of estimates may not be as good as 
implied by the formula even though the correct sample size 
population was used in estimating regression coetficientse 
In line with the points made earlier one can see that sample 
size has been computed on the assumption that considering S- 
E effects does not effect the variance that would occure 
However, if models with higher R* can be developed, then a 
less conservative estimate of necessary sample size could be 
used (for other sample size considerations see the appendix 
to IN 29 )e 

Moving to a rather more technical matter, one may have 
thought it was a matter cf not over illustrating a point 
which resulted in the choice to present material on 
iteration i tor fishing and on iteration 2 for huntinge 
Actually, the choice was a very conscious choice based on 
the fact that one can say in an unambiguous way that the 
model arrived at through iteration 2 on fishing is an 
invalid modele In the end of the review material for 
Chapter VII of this volume there is a brief introduction to 
the matter ot how to determine whether one can accept the 
kind of analysis of variance model used here as structurally 
valide The point made there is that when the iteration 2 
run is made, and one supposedly corrects for the variance in 
the observations, then the predictions of each observation 
should be distributed in a well designed waye When one 
computes an averaywe value of the residual they should expect 
this average value of the residual to come out to be close 
to le Now if one looks at Table 3 they see that the average 
value of the residual for iteration 2 fishing, there called 
standard error of the residual, is 1.8.6 One can compare 
this value of 1e8 with the value of 1-002 for the second 
iteration of the hunting modele AS described in the review 
cf Chapter VII the residual sum of squares values should be 
distributed as chi-squared so the approximation shown in the 
first equation can be used to get the significance test 
Shown in the two equations which follow its: 


(2x2 )**1/2 — (20 NO—NP )-1)**1/2 approx = normal (0-1) 


WHERE NO 
NP = the number predictede 


number ofr observed and 
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(( 4104 )2 )**1/2 — (2(01263 )-1 )**1/2 approx = 40 


which is much greater than the 1-96 deviations that have 
less than e001 probability of occurringe On the other hand 
one has: 


(( 1269 )2)**1/2 — (201263 )—-1 )**1/2 < 22 


and one can accept the hunting ‘"model" as valid because a 
deviation greater than e2 must be accepted as one of the 
many values having a high probability of occurring by 
chancee 

One should recognize that accepting the hunting model 
is not a strong teste There is a distinct possibility of a 
type 2 error’ that is accepting the hunting model when there 
are still structural problems with ite Two points are 
important in this regarde Firstly, results presented in TN 
29 show that the hunting model is probably not valid because 
the effect of supply. on participation is not considered in 
the modele Yet it is noted in TN29 that the effect of 
supply on participation in hunting (in fact for allt 19 
activities for which information was available in the data 
used in this study) could not be demonstrated with data on 
4,000 peoplee There it is shown that (1) supply distribution 
is important in explaining participation in hunting and 
fishing and (2) all that was needed to demonstrate this was 
a larger data sete In this other technical note the great 
importance of considering supply factors if large biases in 
estimates is to be avoided is illustratede Secondly, a 
point has been made here of the value of using weighted 
regfressione Work carried on in revising this note was 
critical in clarifying that there was less reason to use 
weighted regression than had been thoughte This matter is 
pursued in an addition to the review of Chapter VII that 
describes why the chi-squared tests of model structural 
adequacy presented earlier can be made without having the 
results of a weighted regressione 

Finally, from a planning, management and research 
planning perspective the results presented here must be 
considered very importante Etbesis not hard to find planning 
rerorts or documents prepared tor management where 
projection techniques less sophisticated than the one 
described here are applied with no mention ot the fact that 
there may be some error in the results which are producede 
There is often great inefficiency in carrying out research 
because the proper questions are not asked about how 
accurate results needs to be and thereby the question does 
not arise as to whether projections could be made by a 
method Like the one described here rather than going through 
the Laborious; expensive and time consuming job ot primary 
data collectione How many times is it that data are 
collected but are not applied in a certain situation because 
the universe for a study was not quite right, because the 
data are a couple ot years out ot date etce So,» a new 
survey is carried out, but still there is no adequate 
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consideration of what accuracy is going to be gained by 
having the new survey or even if the new survey will give 
results which are as accurate as are neeaed by planners or 
managerse Often hudget dictates sample sizee 

Very often researchers work under the misimpression 
that planners and managers need results which are far more 
accurate than what they actually neede Accurate figures are 
needed on how many people actually do use facilities so that 
use trends etce may be plotted to justify program 
expenditures, but in terms of planning tor trattic, setting 
up adequate facilities to besin operation of @ vark, etcey, 
there is no need to be accurate within plus or minus 10%. 
Even in many cases much lower levels of accuracy are guite 
adequatee Most road designs have a lLeeway of about SO0O%. 

In part the points made above relate to using the kind 
of modet presented in designing good researche A part o£ 
designing good research is not carrying out research when it 
will not produce resultSe And not ~oing through extra labor 
when rather simple computations will give adequate results 
for planning or management decision makinge In this regard 
it is important to note that the kind of results presented 
here are important in making: a decision as to whether 
estimates should he made or a survey should be carriea oute 
What level of accuracy can be achieved given the funds that 
are available must be considerede Obviously, a $5,000 data 
collection effort is not a good expenditure for making a 
decision about carrying out a $5,000 pro jecte To reiterate 
a point, if a researcher could elicit a good feel for 
necessary accuracy for management and planningy he would 
usually tind out that a very small scale study of the Likely 
use of a facility to be built, is quite sufficient to 
provide intormation on which decisions Should be takene 
Furthermore whether millions or thousands of dollars are 
involved the question which must ultimately be asked is 
whether a survey which goes beyond a certain scale is 
producing anymore worthwhile intormation than a smaller 
studye 

Of particular importance in relation to the last point 
is that the prediction of more accurate results by bigwer 
and better surveys doesn,t occur beyond a pointe Till now, 
in this paper, accuracy has been discussed as if the X(e)'sy 
Ny n€e)*'S etce were known and constante But one of the 
greatest problems faced in making projections is that one 
cannot even project total population accurately much less 
project the number of people who will have certain Levels of 
income or be in cities or households of a given size (see TN 
12 )e There is no point in paying a high price to have a 
modtei which gives 41% results under assumptions about what 
tne total population wiit be in 19 years when the prediction 
of this has a high probability of being in error by 1t110%- 
At most the aim should tiien be to have a model that gives 
estimates to arouna t15%e Tnat is, unless data tor the "more 
accurate" model must be collected to meet some other neede 
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CONCLUSION 


The results presented in this article show it is not a 
terribly difficult matter to obtain estimates of the 
accuracy of predictions that can be made using the kind of 
analysis variance equations that have been used in 
recreation research in the paste It has been pointed out 
that the reason for getting involved in the use of such 
equations is that at least in theory results from National 
Surveys can be used to make predictions for particular 
geographical arease This means that the mammoth survey cost 
of doing detailed surveys in small areas might be avoidede 

However, @ word of caution has been added in this 
regarde This was because of the known difficulties in 
developing structurally adequate modelse 

From another perspective the importance of being able 
to make accuracy estimates relates to the researcher being 
able to indicate to the planner or manager for whom he is) 
doing research the importance of knowing the general Level 
of accuracy needed for certain purposese Often when data. 
are "needed" there is no choice of carrying out a survey (or. 
of gathering any primary data) to determine what the 
participation level of population is because there is no 
timee Estimates must be madee There is however a choice of: 
how sophisticated one should get in terms of computations 
that are carried out to arrive at estimatese It is going to 
be a matter of experience to know when interaction effects 
can be ignored and when supply factors such as those 
discussed in TN 29 can be ignored, and a model such as that 
used used here still be employed and be relatively accuratee 
But still, using an analysis of variance model one cannot 
exvect to exceed the accuracy in E estimated by the methods 
given heree If this accuracy is not acceptable for planning 
and/or management purposes then the nature of the problem 
faced can be stated clearly: it must be accepted that the. 
decision will have to be made with Little or no information 
in which one can put taith if there is no time to collect 
datae | 


Finally, possibly the most important consequence of the) 
research presented here is the ratner clear conclusion that 
can be reached that the level of accuracy that can be} 
obtained in preaictions is largely determined by the 
percentage of the populations that participates in an 
activity so that a good guideline has been provided for 
survey designe If researchers can speak to planners and’ 
managers and find out what accuracy they really need in 
information about participation and activities, they have a 
good guide to use in deciding what sample size is needed in 
surveySe They know that predictions wilt only get worse 
when the results attained in the survey are applied to) 
populations which differ drastically trom the population on 
which aata were collectede 


ae 


TN 6 CHAPTER IV page 292 


CORD STUDY TECHNICAL NOTE 20 


ANALYSIS OF VARIANCE MODELS WITH INTERACTION EFFECTS 
AND THEIR POTENTIAL ROLE IN UNDERSTANDING 
AND PREDICTING RECREATION BEHAVIOUR 


Je Arseneault, Ae Dionne,:Je Beaman, Me Renoux 


ABSTRACT 


This paper focuses on certain issues that are important 
in understanding the value of analysis of variance models in 
recreational research, particularly whether a simple 
analysis of variance model is structurally sound and whether 
its use may lead to errors in estimating behavioure 

The results of the Michigan Automatic Interaction 
Detector (AID) program and of regression analyses of the 
1969 and 1972 Canadian Outdoor Recreation Demand Study 
National Survey Data on Canadian'’s Participation in Outdoor 
Activities are included to illustrate how interaction 
effects affect analysis when trying to explain 
participation in an activity using socio-economic variables 
as the independent variablese The results of the analysis of 
Simulated data are presented to show the degree to which AID 
explains data as compared to using a correct analysis of 
variance modele Conclusions are that: 


(1) significant interaction eftects exist when one tries to 
explain Canadian residents! participation in outdoor 
activities in terms of their socio-economic 
characteristics; 


(2) a simple main effect analysis of variance model is not 
adequate to explain most recreational behaviour; 


(3) the use of the AID analysis program gives one an idea of 
the magnitude of the sum of squares associated with 
interaction effects but its use does not provide a 
systematic way of identifying these effects; and 


(4) repeated application of traditional regression methods 
to identify interactions does result in "finding" 
interaction terms tnat improve a simple Linear model but 
the improvement achieved by introducing ten interaction 
terms is usually only the explanation of 1/5 of the 
interaction sum of squares that should be explained; 
what is more, the possibility of ty pe Il errors in 
selecting interaction effects raises serious questions 
about the use of this approach to improve (define 
appropriate) modelSe 
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INTRODUCTION AND PURPOSE 


The best known use of analysis of variance modelling 
techniques for predicting participation in recreation is 
attributable to Mueller and Gurin (see Reference 9)e More 
recently other examples of applying this technique have 
become frequente In the Canadian Outdoor Recreation Demand 
(CORD) Study, Hendry (see Reference 8) suggested using 
analysis of variance (in the’ form of dummy variable 
analysis); this technique was actually pursued using a 1969 
National Survey to develop differentials related to age, 
SEX, family status, etce Subsequently, TN 12 of the Canadian 
Outdoor Recreation Demand Study showed how the socio- 
economic differentials for 26 activities could be used in 
making projectionse Renoux (Reference 11, 12) used this 
methodology to develop a hunting model and other models for 
the Province of Quebece 

An important result of previous CORD Study work and 
Renoux's research has. been the identification of a number of 
problems associated with the use of analysis of variance 
techniquese The purpose of this paper is to focus on one ot 
these problems in particulare The stucture of the note 
reflects somewhat the history of the research on the problem 
that interaction effects pose in developing analysis of. 
variance or other regression modelse It is convenient to. 
present some definitions and then describe some early 
research from which no results are presentede This research. 
Led naturally into other "successful" steps in analyzing 
data to find the existence of interaction effects and 
finally led to the few positive results that are presented | 
in this papere Through this strategy of showing the. 
background of research the authors believe that the reader 
will get the best "feel" possible for the multitude of | 
problems involved in developing and improving the kind of | 
models of concern heree 

The data used in the various analyses presented are not. 
described in any detail but are documented in Volume IIIe 


DEF INITIONS | 


{ 
' 


The term "a simple analysis of variance model" is used 
here to refer to a model in which participation or non— 
participation in an activity can be expressed as the 
cumulative sum of socio-economic effectse It is the kind of 
model defined and descrived in detail in TN 12 (see also TN 
15)-e In equation form such as a model is: 


+ B(4yM) + B(5,Q) +C(i) | 


WHERE YC isgJyKyLyM,C) is 0 or 1 depending on whether 
individual i participated in the activity being 
considered; JsK~+Ly>M,Q refer to the socio-economic. 
categories that person i is in where J refers to a 
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category of the socio-economic variable 1, K refers to 
a category of the socio-economic variable 2, say Age; 
L refers to a category of the socio-economic variable 
3, etce U is a general level of participation in the 
activity under consideratione 


B(1,J) is the effect of being in category J of the socio- 
economic variable 1, B(2,K) is the effect of being in 
category K of the sociovreconomic variable 2, and 
B(3,L) is the effect of being in category L of the 
socio-economic variable 3, etce Finally C(i) is an 
error terme 


It is possible that the kind of model just defined 
over-simplifies the interrelationships between the variables 
used to predict behavioure It is likely that relationships 
between age and education exist that help one understand 
hunting participatione For example, having a high Level of 
education and being old may mean something quite different 
from having a high level of education and being young 
insofar as participation in hunting is concernede A person 
with a high level of education who is old may tend to come 
from an urban non-hunting background and thus have a very 
low probability of huntinge The "simple model" cannot be 
used to reflect the interaction effect between education and 
age affecting hunting just describede (TN 27 pursues” the 
effects of interactions in another contexte ) 

Consider a very simple model involving interactionse It 
is assumed that the participation in an activity can be 
explained by means of two socio-economic effects and by the 
interaction between theme Stated mathematically, individual 
behaviour is represented by the following equation: 


(2) YCisjsk) = U + ALi) + BCG) F Gligyj) + e(kK) 
U + Blis1) + Bl je2) + Blisjrs3) te 


WHERE Y(Ci,j) is the dependent variable, 


B(i,i) is the differential effect oi factor one 
taken at the level i; 


B( .j22) is the differential effect of factor two 
taken at the level J; 


Blisyjes3) is the differential effect due to the 
interaction at the level of factor one 
of the jth level of factor two; 

e is a random error term, and its subscript; 


k indicates a particular individuale 


There are conditions which the B( ) coefficients must 


satisfy which the reader may review in a number of sources 


(ege Reference 10, 12)e However, the important point here 
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is that to have a great deal of freedom in how interaction 
effects are defined, the kind of formulation introduced here 
can be usede The formulation is such that many unknowns are 
usually computed to explain behavioure U and (m-1), (n-1) 
and (m-1)(n-1) values of BC ,1)*s,y BC »2)'s and Bl , »33)'s 
respectively must be computed to define the model when all 
of the usual constraints on the model parameters are 
considerede 

Now if in Equation 2, instead of just having three 
types of BC )*s one wants to have the 10 socio-economic 
variables for which effects were calculated in TN 12, then 
to have a model like the one just introduced with Gli, j) 
type interaction terms between every pair of variables there 
must be (10)(9)/2 = 45 such termse For N variables’7 the 
number of terms is (N)(N-1)/2.6 


Furthermore, just as tor the one (i,j) interaction term ) 


there were (m-1) times (n-1) unknown BC , ? *s to be 


calculated, the need for a "complete" model with 10. 


variables indicates that to define the model the number of 
parameters to computed is dF (n-l1 )}(m-1) over 45 second order 


interaction termse If one has estimated the effects just 
referred to they have not necessarily dealt with 
interactions adeguately: one can consider three-way 


interactions that depend on the value that at least three 
variables in a model take, so there must be the (1,233), 
(29394), (1,593,4) etce interactionse If all three-way 
interactions between variables are to be consideredwhen 
there are N variables in a_ model the number of 3-way 
interactions are N(N-1) (N-2)/03)02)01) which equals 120 for 
N = 10. AS one might guess, for each of these there are 
(n-1) (m-1) (p-1) interaction BC » » » )*'s to be estimated 
where ny m and p refer to the number of levels of the three 


variables of a particular third order terme So it should be 


clear that one cannot simply insert all possible first; 
second, third etce order interaction terms into a model and 


proceed to estimate parameterse When very many socio- 
economic variables are consideredy the number ot 


coefficients that would have to be estimated is so large 
that estimates could not be made on any existing computere A 
more practical consideration is that when large numbers of 
parameters are to be estimated and these involve complicated 
interactions, the situation readily arises where; even when 


a very large data set is available, there are only one or 
two people who fit into certain classes on which there must. 
be information to estimate parameterse What is even more of) 


a rroblem is that there are often no people in certain 


classese Still, there must be information if all the 


coefficients ot a model are to be estimatede 


Beaman and Renoux recognized the kind of problem just 


described when they began work on the research which 


eventually led to this Notee They tried to examine the | 
differences between the model detined by Equation 1 and a_ 


much more adequate model which potentially takes into 
account interaction affectse The more adequate model that 
they chose to use was a model detined by the Michigan AID 
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Computer Programe (It is assumed that the reader knows how 
this program workse If this is not the case reference may be 
made to either TN 4 or TN 27, where they will find examples 
of its usee They may also refer to the original writings 
about the AID Program by Sonquist and Morgan (Reference 
16))- 

What was done by Renoux and Beaman to try to determine 
which interactions should be considered if a model was to 
fit a given set of datae Using the model defined by Equation 
1 and using an AID model, predictions for individuals were 
written out on magnetic tape and became the focus for 
analysis in subsequent computer processinge The idea was 
that if cases could be identified where there were large 
differences between AID predictions and the analysis of 
variance predictions, these would provide a clue as to what 
interaction terms would need to be incorporated into a non- 
AID analysis of variance modele It was planned that the 
differences between these two predicted values could (for 
example) be examined by looking at the average value of it 
for various cross’ classifications of socio-economic 
variablese 

Unfortunately, much work led to few resultse AS one 
might guess from the variance-explained values that have 
been reported in Table 1, the differences between AID 
predictions and analysis of variance predictions were highly 
variablee After much work, what was produced was a 
collection of AID analyses which showed that the Michigan 
AID Program explained much more variance than the simple 
model defined by Equation ie There was almost no success in 
finding out which interaction terms should be introduced 
into the simple model to explain more variancee 

The reader may find it interesting to look at Table 1 
where they will find quite a comprehensive set of percentage 
of variance-explained values in which AID results on the 
1969 CORD Study National Survey Data on Canadian Residents’ 
Participation in Outdoor Activities are compared with the 
simple analysis of variance resultse Analysis of variance 
results for 1972 are also given for reference purposeSse 
Still, the R?® values for 1972 ANOVA are comparable with the 
1969 values because they were produced for persons 18 years 
of age which was the sampling universe in the 1969 studye 
Also they were produced for participation-non participation 
as the dependent variable and with the same independent 
variablese What one should notice from Table 1 is that, as a 
rule of thumb, use of AID resulted in explaining about twice 
as much vwariance as the use of analysis of variance, ANOVAs} 
If one looks at the list of activities in column one of 
Table 1 they first see swimming participation in a city and 
they may note that for males 17% of the variance was 
explained using AIDe The use of ANOVA resulted in 13% of the 
variance being explainede This is obviously not a2 to 1 
ratio but, then, for bird-watching for males, one notices 
the balance shifting as with ANOVA 2.5% of the variance is 
explained compared to 14% for AIDe For outdoor photography 
for males, there is 12% for AID and 6% for ANOVA; which is 
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COEFFICIENTS OF DETERMINATION, 


FOR PARTICIPATION IN 


OBTAINED THROUGH 


PARTICIPATION 
IN CITY 


Swimming 
Nature/Bird Watching 
Outdoor Photography ~— 
Visit Historic Sites 
Visit Other Parks 
Drive for Pleasure 
Sightseeing Urban 
Toboggan/ Sledding 
Picnicking 
Walk/Hiking 
Golfing 
Ice Skating 
Bicycling 


PARTICIPATION 


T 


14. 
15e 
166 
17Je 
186 
196 
206 
21-6. 
226 
2360 
24-6 
256 


N COUNTRY 


Swimming 

Nature/Bird Watching 
Visit Historic Sites 
Visit Other Parks 
Drive for Pleasure 
Sightseeing 
Tobogegan/ Sledding 
Picnicking 
Walk/Hiking 

Golfing 

Ice Skating 
Bicycling 


OTHER 
PARTICIPATION 


266 
QTe 
28-6 
296 
306 
3ile 
32 


TN 


Swimming 

Tent Camping 
Trailer Camping 
Pickup Camping 
Hunting 

Power Boating 
Canoeing 


20 


TABLE 1 


EACH ACTIVITY IN 
FOR MALES AND FEMALES, 


Males 
1969 1969 1972 


AID ANOVA 


(e176) 2132 
(2142) 2.024 
(2119) 062 
(2117) ~062 
(2108) .-060 
(e086) .050 
(2.105) .066 
(2191) 2-066 
(2111) 2.035 
(2-146) 091 
(2117) -069 
(26208) e159 
(e171) -081 


(e202) e155 
(e075) -014 
(2-096) -056 
(2092) e052 
(2-081) «057 
(2-101) .073 
(e180) -088 
(2114) .082 
(e121) «065 
(2128) ~077 
(e139) 2.056 
(2148) ~046 


(2280) 2244 
(2120) 069 
(2-061) 2022 
(e156) .019 
(e118) .084 
(2127) -088 
(2-146) .080 


CHAPTER IV 


Re ts, 


1969 AND 1972, 


ANOVA AND AID 


1969 
AID 


(2-194) 
(2075) 
(2-106) 
(2-084) 
(2108) 
(2-077) 
(2090) 
(2165) 
(2102) 
(2-125) 
(e151) 
(2203) 
(2194) 


(2168 ) 
(-088 ) 
(e101) 
(2 094) 
(-095 ) 
(2-128) 
(.-164) 
(e133) 
(2-099) 
(-141) 
(e151) 
(2-142) 


(-260 ) 
(-088 ) 
(- 084) 
(e125) 
(2-150) 
(2-112) 
(- 094 ) 


Females 
1969 1972 
ANOVA 
4139 .Aars 
¢016,°"c¢> 
@ 053 >. 
e042 036 
e 049 aaa mary 
e042 .045 
e057 025 
2-039 .--= 
e049 .019 
e079 059 
® 034 a oe ane 
e146 .059 
e096 .102 
ei44 --- 
e032 ae 
e058 -065 
® 063 7 oe a! 
e060 .071 
e088 -063 
2087 ¥--- 
e096 100 
e061 £.063 
©0255 > ==> 
e055 e972 
©076 066 


Oa? RE ne 
e043 -100 
e082 -036 
e022 .024 
e035 -0v030 
©064 .065 
e036 066 
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33- Sailing (0131) -053 e058 (2217) «032 061 
34. Water Skiing (0186) 0155 —--— «- (0 215) 2067 —--- 
(35. Nature/Bird Watching (2068) 2017 --- (081) .029 --- 
36< Outdoor Photography (0122) «079 —--— (e091) e055 -—--- 
37- Visit Historic Sites (0112) «078 «092 (+101) .073 .074 


38e Visit Other Parks (eo 101 de 076 --— (2-105 ) 2077 aera % 
398. Drive for Pleasure (e102) -066 -069 (2-104) e067 .095 
40. Sightseeing (e112) «088 e062 (2130) -099 .060 
41. Climbing (e095) £037 --- (2-130) «029 --- 
42- Snow Skiing (e208) 2125 -090 (2223) 2-102 .087 
43-2 Snowmobiling (e160) e103 e131 (¢127) e074 2103 
44. Toboggean/ Sledding (e173) e107 --- (e158) e102 --- 
4S- Picnicking (2133) e090 2.101 (2148) 2-115 .104 
46564 Walk/Hiking (2142) -086 e091 (2-114) -078 .089 
47 Gotfing (e174) e121 --- (e141 ) e043 ase © aa 
48e Ice Skating (e233) e193 e176 (e226) «171 128 
49- Horseback Riding (e198) e102 2.108 (2213) 2.109 .114 
50. Bicycling . (2-156) e083 e135 (2182) -118 .147 
Sie Tennis . (e235) e139 --- (e266) e117 --- 
52- Fishing -—— —-—- e084 --- --- e053 
53. Hunting/Fishing —— —-—- °©104 -- ee e058 
54. Smatl Game Hunting —_—— -—- e077 -- <a e032 


§5. Large Game Hunting woe —— als seihadlens end 


<K><OK2 95> 


very close to the 2 to 1 ratio suggested earliere A similar 
ratio holds for male use of Historic Sites and when one 
examines R® for the female use model for Historic Sites one 
sees that there is a ratio of e908 to «04 or 2 to le The 
results continue in a similar mannere In the odd case the 
-ANCVA model is not too much "poorer" than the AID model 
while most results show that there was a great deal of 
variance to be explained which the ANOVA model does not 
explaine 
When this situation was recognized the decision was 
Made to involve other researchers in the attempt to find 
interaction effects of the magnitude that (it appeared clear 
from the difference between AID and ANOVA analyses) it 
should be possible to tinde The researchers who took on this 
task were confronted with two problemse One was becoming 
tamiliar with CORD data study, and the other was developing 
a strategy for estimating interaction effects that would 
explain something Like the amount of variance that it seems 
clear was possible to explain by interaction effectse This 
thrust of the research effort began with an exploratory 
analysis of the COKD Study 1972 National Survey data on 
Canadian Residents? Participation in Outdoor Activities, 
which data hada become available since Renoux and Beaman had 
begun their worke The variables used from these data and 
their coding are shown in Table 2. 

Initially, a number of equations were derived to give 
the researchers a feel for what second order interaction 
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1972 CANADIAN*S PARTICIPATION 
VARIABLES USED IN ANALYSES REPORTED IN THIS PAPER 


TABLE 2 


Original Value 


Variable Description of the Variable 
Ie AGE 

10 to Ul years 1 
12 14 2 
15 3 
16 17 4 
18 19 S 
20 6 
21 24 7 
25 29 & 
30 34 ¥y 
a5 39 10 
40 44, LL 
45 4S 12 
50 $5 13 
So o4 14 
65 and over 15 


Tle EDUCATION 


No formal 

Some public school 

Finished public 

Some High School 

Finished High School 

Some tech-Senior College 
Graduate of tech-Senior Colle,e 
Some university 

Graduate of university 


MOIRU ROK Oo 


Tite FAMILY SIZE 


One (number of persons) 


Two 
Three 
Four 
Five 
Six 
seven 
Eight 
Nine 


Ten and over 


IN 20 


ce cNauwth © WD) > 


p= 


IN OUTDOOR ACTIVITIES 


Recoded values 
used in | 
analysis 

Reported 

in Table 3 


Gts = 


U1 > 


page vIG 


IVe INCOME 


0 - $ 2,999 1 
4,500 - 5,999 3 
6,000 —- 7,499 4 
7,500 - 8,999 5 
9,000 - 10,499 6 
10,500 - 11,999 ’ 7 
12,000 - 13,999 8 
14,000 and over 9 
Ve CITY SIZE 
500,000 and over 1 
100,000 -— 500,000 4 
30,000 - 100,000 3 
10,000 - 30,000 “i 
1,000 - 10,000 .- 5 
Rural 6 
Vie GENDER 
Male 1 
Female 2 


<> <K POOL KLD 


effects were relatively important (see Reference 
methodology adopted for doing this was as follows: 


(1) Each of several control variables; sex, 


feo 


Cab GN 


= © 


1 de 


age 


and 


education, was chosen in turn (see right-hand column of 


Table 3 de 


(2) Regressions were carried out to determine how 
of an equation to explain participation in 
depended on the value of the control variab 


the form 
hunting 


lee 


For 


example, with age as the control variable, participation 
with a 
tirst independent variable, sex = x(1), then for each ot 
the other independent variables X(2) to X(5), resulting 
in the equations which follow and the others that would 
be written if one followed across the first Line under 


in hunting was predicted for persons 10 to 19 


the heading age: 


P(ot person 10 - 19) = —- e222*(1)sex relation 


© 0-2 
| 


—- e9000*( 4)household siz 
relation 
" = C — e003*(S)income relati 
WKEPRE C is a constante 


(3) Similar results were derived for other levels 
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eO88*( 3)education relation 


e 


on 


ot 


page 


the 
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control variablese Specifically, the age group 20 to 29 
regressions were made giving equations Like the ones 
above with the first three such equations having 
coefficients of X(1) x (3) and x4) of —-e0222, e021 and 
O07 respectivelye 


To comment further, as shown in Table 3 when age is the 
independent variable the equations obtained to estimate the 
probability of hunting for females is: 


~ 


(3) p = 0-064 — 002012 x(2) 


and for males, 


a 


(4) B = 0.341 - 0.052 x(2) 


Because the Lines defined by Equations 3 and 4 are not 
parallel and have stopes that are signiticantly different at 
the e05 level, (the t-test was applied) it may be concluded 
that there is an interaction eitfect between sex and agee If 
there were no interaction effect then the difference between 
the sexes could be accounted for by a sex effect as in the 
two equations following: 


~ 


(5) p (for example) = constant + male effect + B x(2) 


(6) p (for example) = constant + female effect + B x(2) 


WHERE B is a regression coefficient of age that applies to 
both sexese But Since as one can see from Table 2 variable 
x(2) has more than two values (ege 1y 27 3s 43 Sy ete dy one 
can write the following based on Equations 3 and 4: 


p = e064 + 2.012 = -076 for x(2) = 1 for a female 

Pp = e064 + (.012)2 = e088 for x(2) = 2 for a female 
ece etce for all levels of x(2) 

p = e341 —- 2.052 = 2.393 for x(2) = 1 for a male 


eee etce for all levels of x(2) 


The reader can readily contirm that the system of 
equations given above cannot be solved so that parameters 
are determined which make Equation 4 compatible with 
Equations 5 and 6e Having the two coefficients of e012 and 
e052 in Equations 3 and 4 makes it possible to reflect the 
fact that age has a much more vronounced effect on male 
participation in hunting than it does for females in the 
sense that young males may have avery high probability of 
hunting while older males have a very Low probability 
Similar to the general level of hunting for femalese For 
temalesy, what is necessary to reflect behaviour is that 
there be a quite drastic peak in probability oft 
participating trom almost nothing to maybe a e1U probability 
of participatinze However, this peak in relative terms is 
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TABLE 3 


METHOD Ul: RESULTS OF PREDICTING HUNTING PARTICIPATION* 
USING 1972 ORD NATIONAL SURVEY DATA 


Independent Variables Used in Regression 
with Selected Control Variables 


| sex Age Education 
"Control" Variables x( 1) X(2) x(3) 
| Sex 
Males ( 1) —e052 e022 
Females - (2) —e012 e003 
Age 
10 to 1S years (1) —e222 ©0088 
20 to 29 years (2) —-e223 e021 
30 to 39 years (3) —e216 e021 
40 years & over (4) —e1dl e003 
Education 
| No Formal-Finished 
public school (1) —e135 - e026 
Some high school (2) —e250 —-e054 
Finished high 
school and + (3) —e159 —e029 
| Household City 
Size Income Size 
"Control" Variables X( 4) x(5) X( 6) 
| Sex 
Males e031 e020 e343 
Females e006 e012 e009 
Age 
10 to 19 years ©9000 ©9003 2045 
20 to 29 years e007 e004 e026 
20 to 39 years e019 e022 e027 
40 years & over e006 0022 e016 
| Education 
No Formal—-Finished 
public school e014 e034 e025 
Some high school e034 e027 2033 
Finished high 
school and + e006 e005 e921 
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* See the text for material on how to read the Tablee Also 
one may note that significance test on differences between 
the B's were calculated but are not presented here because 
they play no role in the discussion or in arriving at the 
conclusions reached in this papere 


4 


OLOLOLOLS: 


not drastic in absolute terms since compared to males one 
sees that old mates have almost no probability of 
participating whereas a young male has a 50% chance of 
participatinge 

This should make it clear why the great number of 
drastically differing slopes in Table 3 present clear 
evidence that there are interactions that Should be 
considered in developing models to explain peoples? 
participation in outdoor activitiese One could present 
statistical tests for the difference in coefficients, which 
is what was done in an earlier report on the data presented 
in Table 3e (See Reference 1 )e But this is not done since 
there are problems in comparing regression coefficients 
because they are inter-correlated, because in the results of 
including third and tfourtth variables are not considered and 
also because subsequent results presented in this paper are 
more important in contirming the magnitude and significance 
ef interactionse Those who may wish to look at more 
material on what interactions there are and on their 
detection may refer to Renoux (Reference I1, 12) for 
specific examples that have to do with the data ot concerned 
heree More general discussion is found in Reference 16-6 


A FIRST ATTEMPT TO DERIVE 
A FAIRLY GENERAL MODEL WITH INTERACTIONS 


The step taken after the screening procedure just 
described to show the value in pursuing the matter of 
detecting interactions was one of introducing cross product 
terms into a linear modele Unless theory provides a clear 
guide, one starts with a model which offers some chance ot 
success yet is also relatively manayeablee In econometric 
research, and in some other areas where concerns with 
interactions arise, interactions are often first introduced 
by defining them in terms of a cross product ot variablese 
So, if one assumes (as is done below) that 5 socio-economic 
variables are needed in an equation to explain people's 
behaviour anc outdoor activities, one may write the 
following equation: 


Y Set Set TISCL) ot veaa F SUL Aco) 2 BCLs 2 Pee 
ona? B( 4,5 )xX(4)¥(5) + e 
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WHERE X(1) is the age variable, X(2) is variable giving 
information on education, X(3) is a variable giving 
household size, etce as indicated in Table 2, and B's 
are the regression coefficients for the different 
terms in the equation, and e is an error terme 


In the preceding equation one obvious reason for 
keeping only the variables indicated (of those included in 
the 1972 Survey) is that these are variables which one can 
argue to be interval variablese If variables are to be 
multiplied as indicated in the equation it is important that 
the multiplication means something and it would obviously 
not mean something if (for example) one of the variables 
consiaered were (say) 10 arbitrary codes for occupatione if 
any of the variables are to be considered to be nominal 
variables one must return to the kind of framework 
introduced earlier which makes allowance for interactions 
where some or all of the variables in ananalysis are 
nominal variablese 

The preceding comment makes it clear that in certain 
respects the model designed above is not as general as the 
one introduced earliere If variables are actually interval 


variables there is justification for multiplyinge Then and 
only then it can be argued that the model multiplicative 


specified is the most appropriate model for datae But even 


when variables are interval one does not usually know 


whether the age-education effect can really be modeled by 
taking a multiple of age and educatione Taking a multiple 
lacks the flexibility built into an analysis by simply 
grouping data into age-education Levels in which one is 


willing to accept that behaviour will be fairly homogeneouSe 


The multiplicative interaction terms which appear in 
Equation 3 and which are obtained by taking the product of, 
for example, X¥(1) and X(2) are only one guess (ot an 
infinite number of guesses) as to how interactions should be 
Specifiede One could in fact introduce a number oi terms of 
the type X(i) to the power A times X(2) to the power B where 
A and B are exponentSse But a move in this direction is 


Simply a move to increase the number of regression 
coefficientse As indicated earlier, arbitrary decisions of 
this nature that increase the number of coetificients 


ultimately result in complications in analysis that mean 
estimation problems cannot be handled computationally: when 
there are too many regression coefficients, they either 
Cannot be estimated or are estimated very inaccuratelye Soy 
without theoretical justification, there is Little point in 
considering powers of the variables, or considering products 
of the variables including three, four or five variables in 
@ producte These products are simply analogous to the higher 
order interactions that it was indicated could be considered 
in elaborating on the more general model introduced 
initially. 

Returning to the main theme, the hope was that when 
refrressions were carried out to determine the unknowns in 
Equation 3, enough interactions would have been considered 
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that the variance explained would be increased substantially 


over what was explained in the Equation 1 regressionse Table 
4 shows the results that were obtainede One sees in the 
right-hand column that there was a nominal increase in the 
value of R*® when the interaction terms were included in the 


_modele For example, when one looks at the results for a male 
model for tent camping, one sees the R@ was increased from 


about e16 to 18. 

In the first regression for which R® is reported five 
parameters were estimated, whereas in the second 15 were 
estimatede If the interaction effects were not important, 
the increase in variance explained would be that explained 


when the coefficients of the X(i) X€j) terms were not 


related to systematic differences in behaviour but simply 


"explained" random errore In practical terms the 
introduction of these terms should have explained about 
10/( number of cases —- 15) which is about e5% of the 


variance that remained to be explainede But in fact the 2% 
explained is much in excess of the approximately e5% of the 
variance that would have been explained by chanceée An 
appropriate F test for the significance of this variance 
explained is the F test with 10 and infinite number of 
degrees of freedom: 


F(10,infinity) approx = (2000/10)(.015/.985) = 3.04 
which is significant since it exceeds the -e05 level of 2-54. 
in a sense there is no need to go into this kind of 


statistical test to see that the results of introducing the 
interactions are sizgnificante The very fact that many of the 


repression coefficients for the product terms are twice 


their standard deviation indicates Signiticance at the e905 
levele One may notice that the coefficient with value 
—-e00291 for the tent camping model for males is almost twice 
its standard deviation, which is e001606 Similarly the 
coefficient for the X(1) X€4) term, for the X(Z) X(4) term 


and for the x(3) x(5) term are also substantial in 
comparison to their standard deviationse Also, if there are 
co-lLinearity problems, the standard deviations are 


Yexaggerated", causing the values’) of coefficients to be 
small in compariSsone 

Obviously there is the odd coefficient for the 
interaction terms that has a magnitude more than three times 
its standard deviation and which will allow one to accept 
with more contidencee This can be considered to relate to 
the conservation two times rule that Draper and Smith (see 
Reference 3) have suggested be used in some tests ot 
Signiticance in doing regressions where the aistributions 
are in doubte 

It could still be argued that since so many 
coefficients were computed, the Significant ones only 
reflect chance occurrenceSe This can be disproved, but the 
arguements are not presented here€ée ihe fact that tar more 
than 5% of the interaction coefficients are significant at 
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the e05 level is really all the proof that is needede . 

Table 4 shows the parameters of some models that are 
highly statistically significant improvements over the 
regression models without interaction termse However, one > 
may wonder how significant the improvement of an R? from 016 
to e18 is, compared to what could be achievede When one 
compares the results of using the AID program with the 
"Simple" regression results, there may be some surprise that | 
introducing the interaction has explained so little 
variancee There is certainly no basis for a feeling of 
elation because the interaction results are statistically 
Significante This is particularly important in understanding | 
why developing the kind of equations for which coefficients | 
are presented in Table 4 was not pursuede The researchers, > 
who were trying to improve on the simple model, saw that the | 
improved model, though it offered a significant improvement, _ 
did not appear to offer the improvement to be expected if 
the variance that was available to be explained py . 
interaction was being explainede When R*® was (on the 
average) being increased by 10 to 20%, getting in the proper 
interaction terms would increase it by 100%. Something else | 
needed to be donee Or did it? 


VALIDATICN OF AID RESULTS 


It it does not appear that a predictive model is. 
explaining the variance that it should, an obvious first 
ster might appear to be to incorporate more terms into the | 
modele But concerns about doing so have already been raisede 
Another Line of inguiry is to determine whether in fact the 
model is doing well but that the Limit of the R* which might 
be attained has been assessed incorrectlye The possibility 
is that the AID program, because of the way it is set up to. 
search for variance, finds variance even if it cannot be. 
explained by a model that is perfectly appropriate to. 
explain it must be considerede By going to a simulation. 
approach one knows what the true model is because one has 
been used to generate observations and one can then 
determine by how much (if at all) AID indicates an excess of 
variance explained over what can be expected to be explained 
by a model that is appropriate to the datae ) 

It was decided to generate a dependent variable Y} 
having 0 and 1 walues indicating participation or non-_ 
participation and of the general torm of Equation le In the 
Simulation, five variables with four levels of each variable 
were definede Values of Y around a grand mean of one-half. 
were generated tor 1500 casese The simulation formula for 
regression coefficients was? | 


BOi,sgj) = €1/4)001 — 265)/1-5)/2[ 1-3] | 


WHERE j indicates the variables 1 to 5 anda i indicates the 
level of the variable that an individual has, 1 to 4e 
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These are the coefficients ins: 
ECy) = U + BC j,i XC ji) when U = 1/2 


A random number routine was used to independently and 
randomly generate the levels of the 5 independent variables 
that characterize an observatione For example, (1, 3, 4, 1, 
2) could define a person for whom an observation was madee 
For this person: ' 


ECy) = €1/2)+(€1/4){ (-1)94€ 1/3 )€ 1/2 40190 1/4) + 
(-1)(0 1/8 )4+(-1/3)0 1/16) ] 


In the above one has (1/3)(1/2) as what could be described 
as the third term in E(y) because the person has level 3 of 
variable 2e The (1/4) which is in each Bli;j) appears as a 
factor that multiplies all five Bli,j)*se An observed Y was 
generated using random numbers so an observation 1 had a 
probability of E(y) and 0 a probability of 1-Ely). 

in generating collections of numbers Like (1,3;4,1,2) 
for all variables, it was considered that people were in 
levels 1 to 4 of each variable in the ratios 4/3/2/1 so that 
4 times as many people were assigned to level 1 of a 
variable as to level 4. This was done using a random number 
routine so that for a variable X(i), if the random number 
generated was under 1/10 a person was assigned to level 4 of 
the variable X(i).e If the random number was between 1/10 and 
3/10 the person was assigned to level 33 if it was between 
3/10 and 6/10 a person was assigned to level 23; and if it 
was between 6/10 and i the person was assigned to level Iie 

The results of the simulation study are shown in Table 
5 where the ANCVA figures are calculated on the basis of 
theory (because there was no need to estimate these 
results)e The results for AID analysis are the average 
results for 100 analysis runSe AS can be seen the AID model 
when applied to the given data to which another model is 
structurally appropriate explains only slightly more 
variation than the model which is actually appropriate to 
the data, the difference in explanatory power only being 
noticeable in the third figure of R“e The difference in R¢ 
is truly minimal and certainly much less than the difference 
between the ANOVA model and AID results reported in Table le 

Thus the difference in the R? suggested by an AID run 
and the R* found using regression models should not be large 
if the regression models are truly appropriate to the datae 
So, it can be concluded that there is good evidence that for 
models very similar to those developed using the CORD Study 
data, the AID program detects relatively Large sums of 
Squares which almost certainly do not relate to spurious 
interactionse It also appears safe to say that the results 
provide a clear indication that there is a great deal more 
variance to be exvlained in the CORD Study data than was 
explained by using the model with interaction terms for 
which results are presented in Table 4e Introducing the 
further complication of interaction terms only explained 
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about 20% of the variance that should be explained if 
appropriate interaction terms had been considerede If the 
models had been good, R= should have gone up 100% on the 
average, not just by 20% as was the casee There may be many 
more interactions to be considered and/or the interactions 
may be a different type from those which are implicit in the 
formulation that was usede 
TABLE 5 


SUMMARY RESULTS OF AID AND ANOVA ANALYSES 
OF SIMULATED DATA TO WHICH AN ANOVA MODEL 
IS STRUCTURALLY APPROPRIATE 


MODEL 
AID ANOVA 
(1) Total sum of squares 33622792 336062792 
(2) Between sum of squares 59-3586 5668900 
(3) Within sum of squares 276e9206 279-2892 
Mean of y 0¢ 32733 003393 
SeDe of R?# =(2)/(1) 0.170 02168 


OLOTOLOLS, 


The validation of AID results has only brought one back 
to the point of seeing that little was gained by cross 
product analysis but that much must be achieved if models 
are to adequately explain the relation between socio- 
economic variables and participatione 


DISCUSSION 


The preceding commentary in some sense presents a 
logical sequence which has occured in considering how models 
should be developed that may be used to explain peoples 
participation in activities in terms of their socio-economic 
characteristicse However, one very important practical 
question remainse When the logical sequence has been built 
up it shows that interaction terms should be considered in 
developing the Kinds of model for which applications have 
been introduced in TN 12 and 13-¢ Should a simple Equation i 
model be appliea in the way indicated in the TN 12? In other 
words, Should estimates be made tor sub-areas ot Canada 
based on National oor Provincial data using simple ANOVA 
models or is this a dangerous practice? If an area of Canada 
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is very similar to the nation as a whole then one can see 
why one could use a model which is deficient in certain 
respectse Even if interactions are not considered, 
predictions would be corrected in away that "relatively" 
appropriate data suggest they should pee However, when one 
recognizes the disparities in Canada in terms of what 
activities can be carried out, what differentials there are 
in terms of age, income etce then the dangers inherent in 
using parameters for a National ‘model in making predictions 
for a sub-area of the country are obviouSse The area for 
which predictions are made may well be such that the 
National parameters are not relevante One must be very 
concerned that the National parameters are only aggregate 
parameters with no particular relevance to any sub-areas 
that deviate substantially from the national averagee 

In the context of this paper the crux of the concern is 
not whether there are disparities within Canada but whether 
the effects of these - disparities on peoples® participations 
in activities can be adequately modelled only considering 
First order effects: or if second order effects are 
considered if regional differences are "explained" by these 
and first order effects or etce for third order effectse 

One is confronted with the fact that the simple 
analysis of variance model appears to only tap half the 


variance that should be explained by socio-economic 
characteristicse At this point in time it is impossible to 
say how much this deficiency of a model influences 


predictions by resulting in a bias in the accuracy of 
predictions that are madee Actually, there is one area of 
model deficiency on which comment can be mades In TN 29 
supply factors are derived that show that, for at Least some 
activities, ANOVA equations should include not only socio-~ 
economic wariables, but a measure of supply in the various 
areas in which people live for whom predictions are madee 


These supply factors can be visualized by: 


Probability of socio-economic regional supply 
participating = U + effects + factor 


Now, even though supply factors may only account for 1/5 the 
Variance that socio-economic characteristics do, one need 
only Look at TN 29 to see that (for example for skiing) the 
supply factors for Alberta and BeCe are very important in 
making correct predictions of participatione 

The problem with supply effects is that they should be 
considered but they may not be known or may only be known 
inaccuratelye As indicated in TN 29S massive amounts of data 
are required to estimate supply effects from participation 
data and there is no known way now to calculate them based 
On inventory intormation on what tacilities there aree 
Developing formulae for computing supply factors based on 
“resource inventory" data would appear to be very important 
if good use is to be made of ANOVA modelse 

On another matter, the preceding comments of AID ignore 
an important pointe That is that there are kinds of 
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ianteractions which it is convenient to consider and there 
are kinds which it is extemely inconvenient to considere The 
reader may well ask himself why it is not proposed that one 
forget about using the ANOVA models to make predictions 
(Cesario gives an example of using an AID model to make 
predictions in TN 4)e The reason not to use AID models in 
making predictions is that they require detailed 
multivariate information on every individual and data on 
every characteristic of individualse Such information is 
available in Canada on the Census GRDS System but to process 
this information on an individual basis involves making 
special arrangements with Statistics Canada so that 
confidentiality is not violatede It would be a costly 
operatione It would be particularly cost-inefficient if the 
gains in accuracy of predictions were not such that they 
could be  justifiede This could occur because, though 
predictions were slightly more accurate than ANOVA 
predictions in terms of their variance, they may still be so 
biased by inability to consider supply effects so that the 
answer predicted is not significantly closer to the true 
participation values than for less reliable predictions: the 
problems may be validity, not reliabilitye 

Incidentally, from a cost-effectiveness perspective one 
would never consider making predictions using all available 
census data unless the parameters ina model were accurate 
enough to justify knowing population characteristics very 
accuratelye In making predictions one only needs data ona 
sub-set of the population that defines the characteristics 
of the population much more accurately than the regression 
parameterse But, this point is not pursued here because it 
is taken up later in this discussion in a more general 
contexte 

Even if it is not desirable to use AID to make 
predictions, one may not see what the problem is in 
considering any arbitrary interaction between 2, 3 or more 
variablese The problem is that if there is an age education 
interaction effect for males, one must be able to specify 
how many males there are ina specitic age-education group 
to compute the product Bli,j) hli,sgj)e This may not be 
especially cifficult in some casesSe Census data may possibly 
be used if one's concern is with the paste However, to get 
information on males by education for small areas of Canada 
is not something triviale When other variables are 
considered on which data are collected on a sample basis the 
problem is compounded because even it the Census table 
desired is produced for provinces (and many three and four 
variable tables would not be) the data for small areas would 
have to be obtained by special requests for tabulationse If 
Statistics Canada agrees that these should be produced it 
takes time and the cest is highe Furthermore, it does not 
take much background in demography to know that predictions 
of the number cf males ina particular age-education group 
tor sometime in the future are not something that is made on 
a routine basise In fact, it is only recently that modets 
have been developed in which these kinas of projections play 
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a role. Usually, rather awkward formulae have been 
developed to generate the kind of multivariate 
characteristics called for if one is to make projections of 
the number of males in certain age-education groups in (say) 
the year 1990" One certainly Should be careful in 
introducing interactions into a model if they are ones for 
which one cannot make reasonable projections: that is if the 
object of the exercise is to use a modet to predict 
participation at some future timée 

The preceding paragraph raises an issue taken up in TN 
6. If the accuracy of a model's parameters is not all that 
is of concern in using a model and if one is concerned about 
both the accuracy of the Bli,j)'s and the nli,yj), the number 
of people in certain socio-economic groups; one should not 
concentrate on the Bli,j) and problems with interactions 
when the really large inaccuracy is in the nli,j)*s which 
must be estimated! 

When it comes to the matter of modelling using a small 
sample, one should look at the results presented in TN 6 and 
recognize that unless sample sizes are in the order of 4,060 
or larger then predictions made using the regression results 
are going to be extremely inacuratee If some kind ot 
statement is to be made about participation by people ina 
small area in a certain activity, a reasonable choice may be 
to use a telephone survey or some other means of obtaining 
information quickly rather than making predictions using 
modelling resultse Given all the objections that can be 
raised to telephone surveys, etce little is gained by 
replacing the results of such work with results produced 
using a theoretical model when it can be shown that these 
results have errors which are probably far greater than any 
errors that arise ina well planned telephone survey. 

Turning to a quite different and less practical matter, 
an analyst often wants to use regression results to draw 
some kind of conclusions about what is happening in the 
world or in the universe that he is consideringe The failure 
to introduce interaction terms into a model when in fact 
they relate to about 50% of the variance that could be 
explained by the model can certainly be expected to distort 
the picture that an analyst would draw from the coefficient 
which he estimatese 


| In closing this discussion one should note, as 
‘indicated in other Notes, that if one is’ calculating 
people's expected probability cf participation in an 


‘activity then the very fact that probabilities are being 
estimated suggests that each individual observation has a 
‘unique variability associated with ite This 
heteroscedasticity problem, which is encountered in dealing 
with dependent variables which cannot be accepted as having 
& constant variance, is the topic ot concern in the 
Cicchetti ana Smith paper included as an appendix to this 
volume, and there is also useful commentary in the review of 
Chapter VII. 
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CONCLUSION 


This article has presented some rather distressing 
findings about the structure of models commonly used for 
predicting participation and frequency of participation in 
outdoor activitiese It is clear that interaction effects 
play an important role in explaining peoples! participation 
in outdoor activitiese Neglecting such factors could result 
in errors arising which would méan that estimates made have 
substantial biaseSse However, as pointed out, there is no 
evidence as to whether (once supply factors are taken into 
account) biases tend to be very small because the people to 
whom interaction effects apply tend to be very homogeneously 
distributed among the populatione There has been no research 
which shows whether or not there are some sectors of the 
population for which interaction effects are extremely 
important and others for which a simple model would be quite 
appropriatee Until Such research has been carried out to 
clarify this matter it must be recognized that there are 
dangers in making predictions using an ANOVA modele 
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REVIEW 


Je Beaman and Re Gillespie 


The major complication in measuring recreation demand 
is that it does not exist as an entity independent from 
existing supply: there is a functional relationship among 
the demand-controlling forces and recreation opportunitye 
Understanding this relationship is a difficult matter, 
complicated by the fact that even the opportunity side of 
the equation is not clearly definede There exist problems of 
lack of a means of measuring “opportunities" for activitiese 
In part, this relates to a lack of adequate means ot dealing 
with the problem of differences in quality of resources and 
opportunitiese But even if this problem of quantification 
could be overcome, an effective measurement of opportunity 
must be a measure of perceived opportunity, which could 
differ widely from what opportunity (in fact) existse 

The recreation demana of a given community is never 
static, but constantly changing in response to variations in 
the physical and technological, institutional, and socio- 
economic forces which are its determinantsSe Demographic 
variables are only a few of the multitude of causal factors 
affecting participatione "Faddism", rapid taste changes, and 
new technological innovations, are just a tftew of the many 
other factors which make the determination of the Level of 
recreation demand a very elusive ob jectivee 

The tinal complicating factors in the measurement of 
demand for recreation facilities, according to Burton (see 
Keference 5) are concepts of substitute demand, diverted 
demand, and latent demand as they apply to recreatione 
Substitute demand occurs when demand for a certain kind of a 
facility is drawn away to a completely different facility 
which, for some reason, attracts the users of the first (see 
TN 37)e Divertedc demand, on the other hand, comes into play 
in the situation where aemand is diverted from one certain 
kind of facility to a new source ot Supplye Latent demand is 
that which exists but is not being satisfied due to 
restricting factors such as income or lack ot opportunitye 

Having read the preceding comments the reader may well 
ask: "What value is TN 12 when it has to do with applying a 
model to estimate use at some Location and possibly at some 
future time, ignoring most of the demand factors just 
cited?". As the authors admit in the Note, the general 
procedure they endorse is not an oripvinal contribution by 
theme The importance of the Note to the CORD Study is that 
it provides an example of how something may be done using 
the CORD Study data and that familiarity with the ideas 
presented are particularly important in understanding TN 6, 
13, 20, 29 and 366 

This suggestion that TN 12 presents a rather simplistic 
modet does not imply a similar "condemnation" otf TN 6, which 
chose a simple example ot how to calculate accuracy ot 
estimates to illustrate a general principlee The Note is not 
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profound mathematically and possibly the authors should be 
faulted for not making clear that the approach for assessing 
error can be used in the way described for any model that is 
Linear in terms of its coefficientse This includes the kind 
of models described in TN 13 and 20. As well, the method can 
be used with models such as the Rutgers model (Chapter I) or 
the models described in TN 29, both of which have supply 
factorse 

One can fault TN 20 because’ it deals only with gaining 
a better explanation by introducing interactions between 
socio-economic variablese Possibly it would have been more 
relevant to try to determine whether institutional factors, 
supply factors or technological change were more deserving 
of attentione Certainly the notes in this chapter do not 
adequately bring to the readers attention the fact that 


other Technical Notes bring out the following 


considerations: 


(a) Faddism and technological change in predicting activity 
(TN 13 De 


(b) The importance of supply (TN 29). 


(c) The importance of substitutability or alternatives (TN 3 


(dad) The importance of the group a person belongs to (TN 10, 
TN 13 and TN 32)e 


Probably the most serious fault of the above notes is 
that the models proposed are formulated ignoring the 
multidimensional nature of peoples’ participation in 


activities: identification of "types of recreation" may | 
facilitate the prediction of future demand for various 


recreation facilitiese To quote Burton, "If participation in 


any particular pursuit can be linked to participation in 
other pursuits, and to certain socio-economic | 
characteristics, then, as the socio-economic characteristics — 
of a given population change, it should become possible to. 


predict, at least in general terms, how participation in 
given pursuits and groups of pursuits will change". Burton's 
idea (but not method) is endorsed heree Groups of people 
with common participation patterns (as derived in TN 10) are 
(as claimed in TN 13 and TN 32) a more stable base for 
forecasting than participation in individual activities, 
because participation rates in various activities for such a 
group reflect definitive behavioural characteristics of 
people with that "activity package". If, for example, due to 


a reduction in personal disposable income, an individual | 
changes his participation from one activity to another, it. 
is not very likely that he will totally abandon his activity _ 
packawe but rather he will concentrate his activity more in. 


his less expensive activitiese This change then, whould not 
affect forecasts based on "groups as much as it would those 
based on individual activitiese 
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Given the negative comment on the articles in this 


chapter, it is still the view of the reviewers that the 
chapter is a valuable collection of articlese No easy do-it- 


yourself guide is known to be available for making use of 


| 


analysis of variance results in predicting participation in 
activities (eege as in TN 12)-e And having an article (Like 
™N 6) that points up the importance of making error 
estimates is constructivee The numerous studies where 
predictions of the volume of park use or some other activity 
are made with no hint of the error involved are abpundant 
evidence of the need for a method of error eStimatione 


Finally, conditional on the fact that TN 20 is not 
definitive in any way on when a model is good enough, it has 


at least shown how bad some models are and how they can be 
improvede It is to be hoped that this first step will result 


in other research that will produce really good modelse 
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CHAPTER V 


SUPPLY ANALYSIS 


INTRODUCTION 


é 


Through reading Chapter I, it is possible to see that 
Knetsch, in presenting a design for the Canadian Outdoor 
Recreation Demand Study and in developing a further proposal 


for data analysis, saw the “analysis of supply" as very 
impertante So it will not be surprising to find out that 
(contrary to what its number may suggest) TN 16, 


"Considerations in Defining a Methodology for Calculating 
the Supply of Outdoor Recreation", was completed early in 
the CORD Studye However, it may be surprising to find out 
that this study was not really prompted by Knetsch's design 
for the CORD Study but rather reflects Ontario's work on the 
TORPS, Tourism and Outdoor Recreation Planning Studye 
Incidentally, this note does not reflect the current status 
of “research" on measuring supplye Provincial committees on 
supply measurement have continued to meet (to 1976) and in 
1974 one full-time research officer was appointed to be 
responsible for Ontario's supply measurement projecte 

CORD Study Volume III (as is commented on in TN 5) 
makes it clear that lack of compatability in CORD Facility 
Inventory data between the Provinces was a factor that 
forced a particular direction on "national scale" supply 
analysese As the reader has seen in Chapter I, Knetsch 
proposed that Canada follow the "Rutgers!’ methodology" in 
developing modelse This involves introducing measures of the 
level of supply into "origin" models such as that presented 
in ON Vive Buty ain vrod7i, atter expert advice from several 
persons including a member of the Rutger's research group 
(Cicchetti), it was concluded that because of (1) variation 
in types of facilities on which information was collected 
and (2) variation in the years to which data were relevant 
(from 168 to 1971), using the Rutger's methodology to 
agevelop a Canadian model had questionable valuee 

By 1971 Cheung"'s development of a day-use model had 
raised questions about how the availability of “alternative 
supply" around a given city or park affects the use of a 
given unit of supply, eege use of a given parke The need to 
answer such questions prompted the preparation of TN Je 
Strangely enough, work on this note started in early 1972; 
but only in 1974 did the interrelationships between a number 
ot issues become clear enough that the authors felt free to 
complete the note in its present forme 
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Only when the reader of this volume has completed 
Chapter xX will he see how CORD Study researchers’ who 
prepared TN 29, by detailed study of the Cesario model, 
eventually saw a unity between origin and destination 
modelse Work on TN 5 followed by work on TN 11 and 33 made 
clear that there was a need for a work Like TN 296 However 
there is no need to pursue detailse it has long been 
recognized (e€ege see Veal"s comments in chapter 13 of 
Reference 2) that people do not react Linearly to the 
“amount of supply" for participating in an activity 
available to theme Cheung's use ot an alternative factor 
involves the recognition that geographic configuration of 
supply and the "attractivity" of different units of supply 
affect behavioure To cite but two examples, TN 30 and 37 
introduce analysis and literature on the effect that the 
availability of alternative activities in which to 
participate has on participation ina Ziven activitye So 
there was gooa reason to try ana get information from an 
“origin perspective" about how people responded _ to the 
complex supply situation in which they Lived out their daily 
Llivese The search for such information was undertaken in the 
way described in TN 296 
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CORD STUDY TECHNICAL NOTE 16 


CONSIDERTIONS IN DEFINING 
A METHODOLOGY FOR CALCULATING 
THE SUPPLY OF OUTDOOR RECREATION 


Me Cox 


ABSTRACT | 


‘ 


The purpose of this paper is to suggest ways of | 
measuring supply with supply defined as the number of user. 
days of recreation that facilities or aland area are 
capable of supporting in a yeare Relevant terms are defined 
such as: space standard, turnover rate, institutional factor 
(constraint) etce Then in terms of the concepts introduced. 
various “supplies" are defined and explainede These are: | 


le Theoretical Potential Supply - This is the supply of 
recreation that would exist in an area if allt land and 
facilities were developed to handle the participation 
that the area is capable of supporting white continuing 
to maintain the quality of the environmente | 


2e Present Potential Supply —- Present potential supply is 
calculated in the same fashion as theoretical potential 
supply except that at is the present degree of 


development which is considered in the definitione | 
3e Present Usable Public Supply - Not all of the potential 
supply of recreation in an area can  0be considered 
available for consumption by the public because of either 
accessibility or ownership constraintse Present usaoaole 
supply is, therefore, calculated in the same fashion as 
present potential supplye ) 


4e Effective Supply — Effective supply is present usable 
supply adjusted for (i)the institutional constraint 
imposed by present working habits, (ii) legal constraints 
which restrict the days available for certain types of 
recreation, and (iii) competing land or water usee 


Examples of how to compute the supply measures proposed 
are presentede 

This report was prepared early in a continuing Ontario 
project on supply meaSsuremente Many problems in measurement 
and refinements to the basic ideas behind the supply project 


were being documented even as the paper was completed in 
1972. 
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INTRODUCTION 


When deciding where and in what activities, recreation 
dollars should be invested, it is important to look at 
present supply, participation and demande The purpose of 
this paper is to suggest ways of measuring supply, with 
supply defined as the number of user days of recreation that 
the facilities or land area are capable of supporting in a 
yeare When interpreting what is meant by a user day; it 
should be kept in mind that it is the kind or type of 
experience rather than the duration of the outing that is 
important and that, therefore, the time dimension attached 
to a user day is flexible. 


Definition of Terms 


Before discussing definitions of supply, it is 
necessary to introduce a few termSe 


1e Space Standard 


This is the term used in referring to the number 
of recreationists that can be accommodated per 
unit of area without appreciably destroying the 
quality of the recreation experience and/or the 
quality of the environment (for some class’) of 
user)e These standards can run the entire gamut of 
sophistication from, for example, a standard of X 
linear feet of beach per swimmer to a standard 
that varies with the OLI standard of the beache 
(OLI refers to the Ontario Land Inventory which is 
an inventory which involves some basic 
modifications to the Canada Land Inventory. ) 


lt may be that the environmental quality standard 
is reached before the point where the quality of 
the experience is destroyede If this is the case, 
the space standard established for the quality of 
the environment should be the standard used in the 
supply calculatione 


2e Turnover Pate 


More than one person or group can engage in the 
Same activity at the same Location during 
different periods of a daye For exampley one group 
can occupy a picnic table in the morning; another 
in the afternoon and possibly a third group in the 
evening for a turnover rate of 3e In other words, 
there may be three user days of picnicking 
available on a single daye 
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IN 


Institutional Factor 


Most people who are employed follow the normal 
pattern of working during the week and taking the 
weekend offe Therefore, although a similar number 
of user days of recreation are usually available 
on each day of the week, user days of supply on 
weekends are of more value than user days on 
weekdays since most people’ are not free to use 
them during the weeke In the author's view, this 
implies that a reduction factor should be used to 
deflate the weekly estimate of supply (K factor)e 


Ownership and Accessibility (Location) 


16 


There are many types of ownership of recreation 
land and/or facilities and this can affect the 
supply oi recreation available to the publice The 
ownership categories that have been used in the 
1973 Ontario Tourism and Outdoor kecreation 
Planning Study (see Reference 21) #for the 
Household Recreation Survey are as follows: 


Ae Crown Lande 


Be Provincially operated: 
(i) Parks, recreation areas, public 
hunting and fishing areas; 
(ii) Other public facilitiese 


Ce Conservation Authority operatea: 
(i) Parks, 
(ii) Oren Spacese 


De Municipally operated: 
(i) Parks and open spacey 
(ii) Other municipal facilitiese 


Ee Federally cperated: 
(i) Parks, 
(ii) Other public facilitiese 


Fe Private land open to the publice 
Ge Private Land not open to the publice 


similarly, mary recreation opportunities are 
inaccessible because of Locational constraintse 
Tor example, a Lake might potentially provide user 
days oi Tishing, however, if it can not be reached 
wittout areat difficulty then this should be 
recognized in the definition ot usable suprlye On 
accessibility and travel, see IN 14e 
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Se Legal, Seasonal and Weather Constraints 


Not every day of the year is available for each 
activity because of seasonal and legal 
restrictionse For example, Swimming and boating 
are summer activitiese Snowskiing and snowmobiling 
are winter activitiese Most types of hunting and 
fishing are restricted by Law to certain periods 
of the yeare Weather conditions also restrict the 
aays available for particular activities even when 
these activities are in seasone Weather actually 
involves a different nature of constraint on 
supply than legal restrictions but for the 
purposes of this preliminary discussion these 
factors are considered togethere Basically, if one 
wants he can often participate legally in spite of 
the weathere The Same cannot be said regarding 
legal constraintse 


6e« Competing Land or Water Use 


There is often a conflict amongst recreation uses 
on a given tract of land or watere For example, 
water used for waterskiing interferes with the 
simultaneous use of that water for boat fishinge 
Multiple use of areas in a topic that has received 
Substantial discussione Pearse's (see Reference 
16) article "Principles for Allocating Wildland 
Among Alternative Users" is an example of a 
growing Literaturee There are a variety of issues 
regarding non-consumrptive use to which one may 
refere (See Reference 13, 14 and TN 23-e) 


Definitions of Supply 


The author believes that four methods of measuring 
supply should be considerede 


1e Theoretical Potential Supply 


This is the supply of recreation that would exist 
in an area if all land and tacilities were 
developed to handle the participation that the 
area is capable of supporting white continuing to 
maintain the quality of the environmente In other 
words, theoretical potential supply is the supply 
that would be forthcoming if an area were 
developed to the maximum that its OLI capability 
ratings allowede ‘ihe possibility ot maintaining 
the quality of the environment at a given level is 
the tundamental issuee Theoretical potential 
supply is calculated for non-consumptive types of 
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recreation, such as boating, swimming, picnicking, 
as the product of the area available for a 
recreation activity, the turnover rate, the space 
standard appropriate to the highest degree of 
development, and length of the season in days with 
length of season being defined as the number of 
days the activity can be participated in each 
yeare ( The Length of season definition for 
theoretical potential supply includes bad weather 


days during the season but ignores legal 
constraintse ) So non-consumptive is used here ina 
special way because in the Long run even 


picnicking and boating can be consumptivee The 
definition of potential capacity assumes’ that 
management can preserve a nonconsumptive balance 
by applying sufficient capital to maintain a given 
quality otf the environmente 


In the case of consumptive activities such as 
hunting and fishing, the theoretical potential 
supply calculation recognizes biological 
productivity and catch per user day based on the 
assumption that the Lands and waters are stocked 
to their biologically maximum lLlevetle Finally, it 
is assumed that all land and facilities are 
physically accessible and open to the publice 


2e Present Potential Supply 


Present potential supply is calculated in the same 
fashion as theoretical potential supply except 
that it is the present degree of development which 
is considered in the definitione (OLI suitability 
is considered to be a valid index of present 
degree of developmente ) Space standards 
appropriate to the present degree of development 
are used in the calculatione 


3e Present Usable Public Supply 


Not all of the potential supply of recreation in 
an area can be considered available far 
consumption by the public because of either 
accessibility or ownership constraintse Present 
usable supply iS, therefore, calculated in the 
same tashion as present potential supply but 
includes only land and water that is both open to 
the public (ieee TORP categories A to F) and 
accessible (this would usually mean the area can 
be reached by car and/or after a short walk )e 


4. Eifective Supply 
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Effective supply is present usable supply adjusted 
for (i) the institutional constraint imposed by 
present working habits as discussed at the 
beginning of this paper under item eS | 
(Institutional Factors), (ii) legal constraints 
which restrict the days available for certain 
types of recreation, especially hunting and 
fishing and (iii) competing land or water usee 


g 


CONCLUSION 


This paper presented a discussion of a number of 
concepts that are of importance in developing an 
understanding of the supply! for outdoor recreatione The 
ideas presented there have played an important role in the 
development of the Ontario Day~Rec and TORP models and are 
used in "A Method of- Attocation of Recreational Supply to 
Urban Centres"e (See ITN 17. ) 


Ontario's planning depends very heavily on an 
increasingly sophisticated understanding of the nature of 
recreation supplye As analysis proceeds according to the 


requirements of a planning-programming-budgeting system, 
consideration of tradeoffs between alternative supplies for 
a given activity and tradeoffs between activities having 


common or disjoint supply are becoming increasingly 
importante It is imperative that the recreation planner 
develop a thorough understanding of the nature and 


composition of supplye 


APPENDIX 


The purpose ot this appendix is to summarize the 
definitions of supply that have been presented and to show 
by means of example how these alternative supplies might be 
Ccalculatede Factors shown in capital letters distinguish a 
method of measuring supply from the preceding methode 


Theoretical Potential Supply 


—- development to level that OLI capability rating 
indicates 


—- function of area available, SPACE STANDARD 
CONSISTENT WITH MAXIMUM DEVELOPMENT, 
turnover rate, Length of Season, (excluding 
legal constraints ) 


Present Potential Supply 
—- present degree of development considered 


—- function ot area available, SPACE STANDARD 
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CONSISTENT WITH PRESENT DEGREE OF 
DEVELOPMENT, turnover rate, length of 
season (excluding legal constraints ) 


Present Usable Supply 


present degree of development considered 


function of area available, space standard 


consistent with present degree of 
development, turnover rate, tength of 
season (excluding legal constraints), 


ACCESSIBILITY and LAND AND WATER TENURE 


Eitfective Supply 


Table 2 is an example of how different types of supply 


present degree of development considered 


function of area available, space standard 
consistent with present degree of 
development, turnover rate, length of 
season ( INCLUDING LEGAL CONSTRAINTS ), 
accessibility, land and water tenure » 
INSTITUTIONAL CONSTRAINTS and COMPETING 
USES ON LAND AND WATER 


Table Il is a tabular presentation of the 


calculatede 


™N 
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TABLE 1 


SUPPLY CALCULATIONS CONSIDERATIONS* 


Supply Categories 


Considerations Theoretical Present Present 
Potential Potential Usable Effective 
1e Potential Stage X 


of Development 
2e Present Stage 
of Development 
3e Area Available 
4e Space Standard 
Se Turnover Rate 
6e« Length of Season 
(includes bad 
weather consi- 
derations ) 
Te Accessiblity x xX 
(% accessible ) 
Se Cwnership xX xX 
(% open to 
public ) 
Ge Institutional X 
Constraints 
10eLength of ».¢ 
Season Recog- 
nizing Legal 
Constraints 
11-Competing Land X 
& Water Use 


Ps 
pa 
a 


ba P< PS OK 
PS OP OB OD 
rd pd bd Od 
bd bd bd bd 


* XY [Indicates that an item is considered in the calculation 
of supplye 


OC LOLOLOLS 
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TABLE 2 


SUPPLY CALCULATION AN EXAMPLE 


cd) 
oD 5 
vag 
gO » 4 u > 
3 ocd Gq om () am_N 
aw Sos re) an 
UW o vod a a >> 
eos iy py eS 30 
Ho v a AA 
G4 Vd ~nwU v 
fe) : oa (3) = a <1 aw 
al: g¢ acing ae Sees 
Supply Category/ oo 4) v Rl) ae} na Oo ~ a ao uv oD 
Considerations S § oo a ga oo wa aa a 8 anes 
pe =| oe g re) ~ Ow vA a Ce CR 
a. wa q Se 0 A cA e), 1) rf) 
ao) nO a o UGS ect (ON) eutssl sl) Gane 60 
Oe om i] oS Oy OD «aA Sy eyeek en 
‘4 Ov vo > on Vv C HO ere oO eo cd 
> a Pp <x > ce iO ae Ol Cs Cae OQ) in eouan fetes Salam 
ac ov Oe) vo oO YB DO HAH HY DW 
vA aA w mee i) Gd Vow ON A 
° Ne Ne MRL vais | ios appaOreeNn enee Wena eaitS 
Ay AY x mn 4 og hays EO © Clase . Classe 2 alota 
Theoretical Potential xX 100 5 as.2 90 90,000 
Supply 
X ROOS Sa. ae 90 108,000 198,000 
Present Potential X 100 Sim 2 90 54,000 
Supply 
X 200 Bf 90 72,000 126,000 
Present Usable Supply X 100 S12 SOR Tar nou 20,250 
X 200 a: SORay SO 27,000 47,250 
Effective Supply X 100 By 4 Uy SOR ge Sy MIO) OG codon. JReke: 
X 200 Qe 2 VEY VSO eres) ISTO RS 20 e250 35,438 


In other words, the area could support 198,000 user days of bathing if developed to the limit 
consistent with its OLI ratings. It is presently providing 126,000 potential user days which, 
when ownership and accessibility are considered, reduce to 47,250. Finally if institutional 
constraints, legal constraints and competing use are accounted for, the effective supply of the 
area is 34,438 user days of bathing. 


(The assumption numbers correspond to those used in Table 1) 


l. The area is in its natural state. 

2. The area is in its natural state. 

3. Area available - 100 linear feet of Class 1 and 200 linear feet of Class 2 bathing beach. 

4. Space standards - with no development - Class 1 = 3 and Class 2 = 2 people/linear foot 
of beach - with Class 1 = 5 and Class 2 = 3 people/linear foot of beach. 

5. Turnover rate is 2 people/day. 

6. Season is 100 days with weather conditions preventing use 10 days/year. 

7. 75 percent of the area in each Class is physically accessible. 

8. 50 percent of the accessible area for each Class is open to the public. 

9. Institutional factor ("KR") ts .75- 

10. No legal constraints. 

11. No competing land or water uses. 
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CORD STUDY TECHNICAL NOTE 3 


THE DEFINITION AND EVALUATION 
OF A CLASS OF ALTERNATIVE-SITE FUNCTIONS 


Je Beaman and Se Smith 


ABSTRACT 


The effect of the availability of alternative Locations 
at which to participate in an activity has long been 
recognized as affecting participation at a given sitee In 
fact, it has been recognized that either having a variety of 
alternatives around a person's origin or having a variety of 
destinations around a possible destination means that origin 
and destination alternative factors should be considered in 
modelling travel behavioure 

A number of desirable characteristics of alternative 
functions are discussed in this paper and it is indicated 
that the particular class of alternative functions being 
considered satisfies each condition under specific 
conditionse Ways are shown in which the approach to defining 
alternative factors has been unnecessarily ad hoce 

The paper presents results that demonstrate that the 
coefficients (exponents of distance) that some researchers 
have used in defining alternative factors mean that distant 
alternatives are more important in determining people's 


behaviour than facilities that are closee The range of 
value of exponents that result in close alternatives being 
more important to people's behaviour than distant 


alternative is definede 


PURPOSE 


The purpose of this note is to discuss the formulation 
Of, and the problems associated with, an alternative-site 
functione 


BACKGROUND 


The seminal work cn the use of alternative sites in 
travel modelliny: is probably Stouffer's intervening 
opportunity model (see Reference 22)e Stouffer argued that 
the reduction in the number of trips to increasingly distant 
destinations was not a reaction to distance costs, but 
rather a reflection of the successful competing for 
customers by nearer facilities over farther facilitiese 
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More recently, Grubb and Goodwin (see Reference 10) 
developed a techique for operationalizing the concept of 
competing facilities as a component in recreation travel 
modelse In their work on visitation to reservoirs 
distributed around Texas, Grubb and Goodwin formulated a 
general visitation equation: 


C1) Y = A{XC1)**(bC1)) XC 2)**0b02)) X°03)**0b09)) XC 4)**( DC 4) ) 
X(5 )**( bC5 ) )} 


WHERE 


Y = the number of visitor days from origin i to reservoir j 
per unit of time; 


X(1) = population of origin i; 
xX(2) = round trip travel costs between i and j3 
X(3) = per capital income at i; 
X(4) = surface area of j3 
X(5) = a variable measuring the effect of competing 
reservoirs available to users at origin i on 
attendance at reservoir jy; and 
Agbl(1)s,eee,b(5) = parameters to be estimatede 
In the context of this article the Last variable, b(5), 
represents the major contribution of the Grubb and Goodwin 
studye Their competing-site or alternative-site function, to 
measure the impact of competition oon reservoir jo was 


operationalized as: 


(2) xXCid = OE (Clog S(k)/D(i,k)) 


WHERE 

XC i) = alternative-site factor for origin i, reflecting the 
existence of alternatives to destination j; 

S(k) = surface area of competing reservoir k3 


DCi,gzk) = distance between i and k3 


nm = number of reservoirs within 100 miles (an arbitrary cut- 
off distance) of origin i; and and the summation is 
over n reservoirse 


The expected sign of X(5) in a least Squares regression 
solution is negativee Thus, attendance at reservoir j can be 
expected to decrease as: (1) the number of alternative 
reservoirs (within 100 miles) increases; (2) the surface 
area of alternative reservoirs increases; and/or (3) the 
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distance to the alternative reservoirs decreasese Thus X(5) 
is intended to reflect the competition from all other 
available reservoirse 

A number of authors have furthered this investigation 


by introducing alternative-site factors into their 
recreation travel modelse One of the most recently presented 
models that utilized such a factor is a day-use visitation 


| 
| 
) 


| 


model developed by Cheung (see TIN 1i1)e Other formulations 
which have been included in travel models were developed by 


Cesario, Goldstone and Knetsch (Reference 3), Pankey and 
Johnston (Reference 18) and Elsner (Reference 8)e The effect 
of alternative sites has also been approached by examining 
them in a trade-off contexte This approach considers the 


effect of alternative sites on the distribution of users 
among a system of competing facilitiese A few relevant 
examples of this approach include the Ontario Tourism and 
Outdoor Recreation Plan Systems Model (Reference 24), 
Wennergren and Nielsen (Reference 27), Ullman and Volk 
(Reference 25) and Knetsch and Cesario (Reference 12)e. 


THEORY 


Figure 1 is used to present a Simple graphic 
illustration of the alternative-site problem and some of the 
notations used in the following discussion of theoretical 
issues relating to the development of alternative-site 
functionse The volume of visitor flow from i to j is usually 
predicted on the basis of variables such as the population 
of i, the attractivity of j and the distance between theme A 
closer modelling of reality requires the inclusion of some 
measure of the competition of other facilities offering 
Similar recreation experiences to the Same user groupe This 
is the purpose of introducing an alternative-site factore 

The omission of such a factor in, for example, the 
Simple gravity model (which includes only measures of origin 
and destination characteristics, including distance, but not 
of alternative destinations) leads to the following logical, 
but unrealistic conclusione Consider a simple unconstrained 
gravity model which predicts per capita use of a facilitye 
If five more identical facilities are developed at the same 
distance from an origin, the gravity model implies that 
total per capita visitation at all sites will be six times 
as high as with one (ieee, it implies that demand is 
infinitely elastic with respect to supply )e An alternative- 
site function embodies the more realistic assumption that 
total per capita visitation may increase, but not at the 
Same rate as the proliferation of new facilitiese 

There is an exception to this scenario that deserves 
mentioning, esvecially in light of the very high usage some 
recreation facilities receivee If a facility is being used 
to capacity for some given activity, then the addition of 
one identical facility may result in a doubling in per 
capita visitation ratese This can be expected to occur 
whenever there is a latent demand for the services of a new 
facilitye The solution to this apparent problem is the 
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Alternative site 


Park j with 
attractivity Aj 


Number of visitors, Vj, 
Alternative site x Origin Di, Alternative site kK, 
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recognition that if a facility is being used to capacity, a 
gravity model is inappropriatee In this situation of very 
high demand and very tow supply, the appropriate use-level 
_ forecasting techniques should be based on capacity measurese 


Another consideration is the fact that the reaction to 


alternative sites is largely a perceptual processe Consider 
now that a user at some origin is evaluating a large number 
of sites around hime It is quite feasible that, while he is 
able to distinguish among individual, nearby sites, he is 
unable to distinguish individual sites in a distant resort 
regione If he decides to go to this more distant region, his 
| choice is based not on the sum of the attractions of the 
' individual facilities in this region but rather on his 
perception of that region as a single "facility". 


However, it is reasonable to suggest that once this 


user has arrived in that region and begins to make a choice 
| between alternatives there, a second alternative-site 
_ function becomes important —- a destination alternative-site 
functione This function is distinct from the initial 
alternative factor, which can be referred to as the origin 
alternative-site factore The former is used to suggest, for 
_example, that different stretches of a distant, developed 


beachfront compete with each other when (and only when) a 
user has arrived in the Local areae Similarly, at home a 
user may decide among several National Parks, but once he 
arrives at his chosen park he has to decide among 
alternative campgrounds within that parke 

It is possible to argue that this problem of perceiving 
at a distance differences between closely situated parks is 
solved by the attractivity factor included in the Grubb and 
Goodwin alternative factor formulatione However, the 
subseguent discussion shows that, in terms otf the decision- 
making strategy outlined by the authors, the way a decision 
is made as to how one reacts either to comparable facilities 
competing with each other or to sub-units within a given 
park or beach, is best understood by considering both origin 
and destination alternative-site factorse 


THE MATHEMATICS OF DEFINING AN ORIGIN 
OR DESTINATION ALTERNATIVE-SITE FACTOR: SOME GENERALITIES 


While the authors have argued for a recognition of the 
possibility ofa two-stage decision making process with 
regard to the ultimate choice of a recreation site, the 
general concepts and mathematics underlying each stage do 
not differe It is possible to begin this section by offering 
three basic, imrortant criteria against which any proposed 
alternative-site factor must be measured: 


le The competitive strength of any given 
alternative in an alternative-site function 
should be related to: (a) a measure of the 
attractiveness or utility of this alternative 
for a given usages3 (b) the relative 


to 
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accessibility of the alternative from the 
“origin” of visitors (either their home or an 
equivalent point on the highway ); 


2e The magnitude of the alternative-site function 
should reflect some aggregate of the individual 
alternatives; ieee the competitive importance 
of the alternatives as a group is directly 
related to their total number; 


3e Any alternative-site factor should reflect the 
addition or subtraction of a new destination 
area within the field of existing 
possibilitiese 


These criteria, however, are operationally imprecise and 
vague in that they do not state even the direction of 
relationships between the variablese 

Within the contex of these general ideas, the aim of 
this paper is to describe how alternative factors should be 
defined (explicitly) so that irrational assumptions about 
the human decision-making process are avoidede The following 
mathematics are presented as a context in which to analyse 
the alternative-site factors suggested by some researcherse 
These mathematics can thus be used as an example of the kind 
of matters that must be explicitly considered in order to 
define an alternative-site factor in both a mathematically 
and behaviourally acceptable mannere 

Consider the following function which is a generalized 
Grubb and Goodwin alternative-site functions: 


(3) xX€Ci) = OE (ACK)/FCDCi,gk))) 


WHERE 
X( i) = alternative-site function tor some origin i; 
A(k) = attractivity of alternative site k3 


F(DCi,k)) = a function of the distance between i and k3 


nm = number of alternatives, and the summation is over n 
alternativese 


This equation suggests that an alternative-site function is 
defined by the sum of the ratios of attractivities of site 
to the distances of the sites from some origine For an 
origin alternative-site factor, the origin is the visitor's 
residence, a city or the center of population of some 
regione In the case of a destination alternative-site factor 
the origin may be the point of arrival in the tourist 
regions: an airport, the edge of town, a visitors! 
information ottice or the entrance to a parke 

The attractivities that appear in the numerator can be 
measured in any of a number of ways, but current research in 
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Canada suggests that, at least for the purpose of main- 
destination visits to a site, attractivity measures are best 
estimated using the Cesario model (see TN 4)e The distance 
function F(D(i,k)) is usually considered to be positively 
monotonice This function can be based on several general 
measures: geographic distance, travel costs, travel time, or 
perceived accessibilitye (For example, a visitor to an 
unfamiliar resort town may impute a smaller distance to a 
facility whose location is on the main street, a couple of 
miles from him, than to a closer facility whose address is 
on an unknown Side street )e 

Equation 3 clearly satisfies the three criteria 
enumerated abovee If, for example, a new site is included 
within the area of available alternatives, the alternative- 
site function increases in strength as the attractivity-—- 
distance ratio for the new site is added to those of the 
existing alternatives. Also, the more attractive a given 
alternative is, the more important it is in determining the 
value of the alternative-site function compared to other 
Sites equidistant from the point ot reference, either an 
origin or destinatione This statement is, of course, based 
on the fact that if two sites enter into the formula and are 
at the same distance, the one with the larger attractivity 
has a larger ratio of attractivity to distance and therefore 
adds a greater amount to the alternative measure than the 
less attractive sitee As for the relative accessibility of a 
given destination, the inclusion of a distance function in 
the denominator indicates that as an alternative destination 
gets further from a B@iven origing the alternative factor 
calculatec is less influenced by this given destinatione 

As a final point, it is desirable to discuss one 
apparent Limitation of the alternative-site functione 
Consider a region with two existing facilities serving one 
city, is shown in Figure 2: In this scenario a planner is 
attempting to decide between two possible sites for a new 
facility -—- one fairly close to i (site B), the other more 
distant (site A) but otherwise identical to site Be In 
forecasting expected use levels, the planner uses Equation 3 
as a measure of the effect of alternative sites on 
attendance at the new facilitye Recalling the components of 
Equation 3, it is clear that the value of the alternative—- 
site factor is the same regardless of whether Site A or Site 
B is chosene Nevertheless attendance at B, ceteris paribus, 
would surely be higher than at Ae Now, some planners may 
feel intuitively that this is at least in part a reflection 
of A experiencing greater competition from the interveninyzg 
opportunities at sites 1 and 2, than would By which is 
closer to i than sites 1 and 2e This view implies that an 
alternative k has a different effect on trip flows from i to 
a destination jy, Gepending on whether D(i;j) is greater or 
less than Disk )e This in fact is the basic assumption in 
Stouttfer's intervening opportunities model, in which only 
places that are closer to origin i than j iS, are included 
in the operational definition of intervening opportunitiese 
All other sites are considered to have no competitive effect 
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on trips to je It is to be argued below that this type of 
binary treatment of alternatives Leads to conclusions 
Logically inconsistent either with observed trip patterns or 
with the assumption of a uniform utility function which is 
implicit in Equation le 


However » before pursuing that argument, a more 
sophisticated equivalent to Stouffer's definition of the 
alternative-site factor is considerede The United States 


Corps of Engineers (1972) suggested: 

(4) x(i) = [1 + OE log S(k)/D( i,k) ]? 

for all log S(kK)/DCiygk) > Log SC j)/DC isi), Jj#k=1,_n 

WHERE the variables are the same as defined for Equation 2. 


There are three differences between Equation 4 and Equation 
3e First, only those ratios of log S(k) to D(Ciyk) which are 
greater than the ratio of log S(j) to D(i,yj) are includede 
In other words, if S(k)/D(i,k) is less than S(j)/D(Cisj) then 
the alternative site k is not considered as an alternative 
at alle (It should be noted that one (1) is added to the 
restricted summation to prevent X from equalling zero, since 
the Corps of Engineers used 1/X(j) as a regressor in their 
estimating equation of reservoir usee Also the squaring of 
the quantity, one plus the summation, implies that the total 
effect of all substitutes increases at an increasing rate.e ) 
The intuitive feeling that sites 1 and2 constitute 
greater competition to site A than site By, reflects the 
assumption that the competitive strength of an alternative 
varies according to the relative location and attractiveness 
of the destination j being considerede The implications of 
this assumption are as follows. If it is argued that the 
volume of trip flows from origin ito j is unaffected by any 
site k whose A(k)/D( i,k) is less than AC j)/DCli,yk), then in a 
behavioural sense, this implies that the population at i do 
not consider any site k to be an alternative to Jj if. 
A(k)/D(igk) is Less than AC j)/D(Cigk)e The corollary of this. 
is that there is no reason for any trips from i to .j if@ 
there exists any alternative k such that A(k)/D(i,k) is. 
greater than AC j)/D(i,k) since the Latter, by the above 
assumption, does not constitute an alternative to andy, 
therefore, competition for the formere In turn, and more 
fundamentally, this implies that for any origin i, one would 
expect all park users of the same type and with the same 
purpose to patronise the same park, since in all but very 
rare cases there is only one site which maximizes 
A(m)/DC ism), m = 1y 2Zgeeey ne Observation shows’ that all 
users of the same type and with the same purpose _ do(no(_ 
patronise the same site and therefore observation 
contradicts the above implication of deterministic choice. 
behavioure The only way to reconcile the deterministic 
behaviour implicit above with the reality of trip flows from 
i to more than one destination is to assume interpersonal 
differences in people's estimations of sites’ attractivities 
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CHAPTER V 


Proposed 
Site A 


and/or distances, so that the site with the maximum 
ACi)/D(i,zj) wilt vary from person to person, and trip 
cheices wilt vary accordinglye However, as presently 
written, Equations 2 through 4 do not allow for 
interpersonal differences in people's estimations of A(j) 
and DlisJj), J = 19 Z2yeees ne Thus Equation 4 and (by 
implication) the intervening opportunities model are 
internally inconsistente 

It might be argued by some that the idea expressed by 
the intervening opportunities model and Equation 4 is 
basically sound as a macro modele The values generated are 
expected walues based on stochastic assumptions rather than 
deterministic predictions for individualse An operational 
flaw still in these models, however, is that they give no 
weight to sites whose D(i,k) or AC k)/D( i,k) are less than 
D(ie~ej) or AC j)/DCigj)e This could be overcome by giving 
smaller, but positive, weight to alternatives with 
relatively low D(C i,k) or AC K)/D(Ci,k)e But this would require 
prior knowledge of the sites" attractivity values such as in 
Equation 4 where S(k) is assumed to be a surrogate for A(k)e 
And this leads to a type of circular reasoninge For if A(k) 
(an attractivity factor) is to be estimated using a model 
with an alternative-site function (such as a Cesario model), 
then the estimates of attractivity depend on a knowledge of 
the alternative weighting factors which, in turn, depend on 
the attractivity value. 

Though some iterative procedure might eventually be 
devised to overcome this problem, it is questionable if such 
an effort is merited since there is no behavioural evidence 
to suggest that this (or any other form) of variable 
weighting of alternatives takes place in the human aminde 
Therefore, a simpler assumption is made in the following 
discussione Specifically, alternative sites contribute to 
the alternative-site factor in direct proportion to the 
value of ACk)/D(C isk) (Cor some similar function)e The result 
is that a weight of unity can be applied to each term in the 
summation in Equation 3e In other words, the competitive 
strength of alternatives is assumed to be a quality 
intrinsic to them and not something which varies according 
to the destination being considerede Given this assumption, 
and returning to the hypothetical planning problem, the 
greater usage of site B than A can still be explained in 
terms of site B having intrinsically greater strength than 
site A against the competition from sites 1 and 2, on 
account of B's attractivity-distance ratio exceeding A'Se 


THE CHOICE OF A DISTANCE FUNCTION 
AND A SET OF FACILITIES TO BE CONSIDERED 
AS ALTERNATIVES TO A GIVEN FACILITY 


Now that an alternative-site measure has been 
formulated, X(i)+, which satisfies the criteria for such a 
measure, it is tempting to accept it and use it without 
considering further the behavioural implications of the 
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measure. However, it is a simple matter to show that the 
function has very aifferent properties depending on the way 
F(D(i,k)) is defined. For example if: 

(5) F(DCisk)) = DCiyk)[La] 


WHERE 


then the value of the exponent has a profound effect on what 
X€i) suggests about the decision making procesSe There are, 
| 

in fact, two considerations: 


(1) how the universe of alternatives is defined (in areal 
| terms), 


(2) what distance function should be usede 


‘And an important point to note is that the two issues cannot 
really be dealt with separatelye 

The question of the areal delimitation of the 
population of alternative sites was an issue in both the 
‘Grubb —- Goodwin and the Cheung studiese Their arbitrary 
‘resolution was to define the population of alternative sites 
as that group of similar facilities located within 100 miles 
of the origine Although arbitrary, Cheung did find empirical 
(evidence that this radius was a reasonable one for planning 
purposese He found that 90 percent of day-use visitors to a 
group of parks travelled less than 100 miles one-way toa 
parke From a relatively loose, practical viewpoint one can 
ignore the existence of distant parks; put from a more 
precise, theoretical viewpoint one can stitl acknowledge 
their influence, even if it is minimal, because of the 
‘visitor's reaction to distancee 

| Whether people consider alternatives only within a 
certain distance is related to a theoretical concern that 
has implications for defining alternative factors which are 
not pursued heree So the reader is asked to keep in mind 
that the arguments presented do not adequately deal with how 
people react to different distancese (See TN i4e) Early 
location and transportation models typically presented the 
view that "economic man" was hyper-sensitivee For example, 
consider two equally attractive parks, A and Be If A is 
located at 10 kilometers from a potential user, and B is 
(10-1 kilometers from this user, these models argued that A 
would always be chosen over Be A more realistic view of the 
perception of distance differentials is one that describes 
the rational man as being threshold-sensitivee In the case 
Just rresented, most users would not be sensitive to a 
margin of Vel kilometerse If, in a different situation, the 
facilities were located within a few hundred meters of the 
user, a marginal increase of Oel kilometers could be 
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to 


Significante This threshold is referred to in psychology as 
the just-noticeable-difference ( jenede ), and it has’ been 
shown in psychophysical experiments to be proportional to 
the magnitude of the stimulus, in this case distancee If 
distance is implicitly used to define the set of parks that 
serve as alternatives for another park (as it was used 
above) any number of distance functions may be considered 
relevante 

Figure 3 shows a hypothetical distribution of parks 
around an origine Ten rings, equally spaced, surround an 
origin ie In the hinterland, covered by the rings, are a 
number of parkSe For purposes of this discussion they have 
been uniformly spaced on a rectangular gride The distance to 
any circle, j,» from the origin, is DR(j) the distance to the 
next more distant circle from the center is DR(jti)e The 
distance between any two adjacent circles (the width of any 
ring) is "w" and this is constante One unit of area has been 
sketched on the map and’ is sized so that it encompasses four 
parkse Thus the density of parks, "d", is four parks per 
unit areae If this square area unit has sides of one unity, 
application of the Pythagorean Theorem indicates that wy, the 
width of the rings, is about Oe«7 junitse The outer circle 
encompasses the maximum distance that one is physically able 
to drive round-trip in one day, with just a minimal amount 
of time spent at a sitee The extreme example would be the 
case where a user drives 110 kilometers per hour (about 70 
mph) for 12 hours, hops out and back into his car in a 
second (thereby qualifying as a visitor) and drives home at 
110 kilometers per hour for 12 hourse The outer ring, the 
physical Limit for a day-use trip, is 110 km x 12 hours = 
1320 kilometerse In nearly all real-life day-use instances, 
this physical limit is not even closely approximatede Its 
use, however, avoids drawing a smaller arbitrary boundary 
based on intuition or Limited Knowledge of past behavioure 
The physical maximum also avoids the unrealistic and 
mathematically difficult situation of an infinite travel 
surfacee 

One of the first things that is noticed when one 
examines the figure is that the successive rings of equal 
width, moving away from the origin, include an increasing 
areae Given auniform distribution of parks in space; as 
shown, the further one goes away from the origin the greater 
will be the number of sites included in a ringe The average 
number of parks per unit area is dy so the number of parks 
that may be expected to be found between any two circles is: 


(6) N 


area of ring * density of parks 
{(2PI DR(j) + 2PI DR( j+1))/2)} (Cw)( a) 
WHERE PI = 3e141G6ceec 

AND since 2PI DR(j) approx = 2Pi DR(jti ) 

1F w is small compared to DR(j), 

THEN: 


(7) N approx = 2PI DR(j)(w)(d) 
2PI DR( jt1 Cw) d) 


approx 
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Circle m = maximum physical distance 
one can drive in a day and 
cer te return 


origin 


alternatives 


t area with 


units 
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FIGURE 3 


HYPOTHETICAL DISTRIBUTION OF DAY USE PARKS AROUND AN ORIGIN 


rN 3 CHAPTER V page 345 


WHERE N = expected number of parks ina ring; 


DRC Jj) = radius of inner circle; 
DR( jt+i1) = radius of next outer circle; 
w = width of ring; 


d average park densitye 


If we assume all parks are equally attractive, the 
attractivity component in Equation 3 can be taken as a 
constant, AAs and the alternative-site function can be 
rewritten as: 


(8) x@i)d 


AA @E (1/F(D(Ci,k)) 


WHERE AA = attractivity of alternatives; a constant, 
and the summation is over n alternativese 


If, however, we let DR(j) >> Wy, then the distance from the 
origin i to all the alternatives in the ring between j and j 
+ 1 is approximately DR(j) approx = DR( jt1)e In other words, 
if the ring width is sufficiently narrow, the distance to 
all parks in the ring is approximately equal to the distance 
to either the inner oor the outer boundary of that ringe 
Hence DR(j) can be substituted for D(i,yk) in Equation 8, 
where k is a park in the ring between circles j and j + le 
Using Equation 7 as an estimate of the number of parks in 
any ring, Equation 8 can be rewritten as: 


(9) x€Ci) = AA OE 2PI DRC j Mw) d)/FCDR(Jj)) 

WHERE m — 1 = number of inner circles; and the summation is 
over m-1 inner circles; and the other variables are as 
detined in Equation 3, 7, and 8e 

Moving the constant terms outside the summation gives: 

(10) x€i) = AA (C2PI)(Cw)(d) OE DR( j)/FCDR( Jj) ) 

WHERE the summation is over m-1 inner circleSe 

One question that was raised earlier, and remains 
unanswered is the choice of an appropriate distance function 

in the alternative-site function F(DR(j)) in Equation 10-6 

The most common form of FCDR(j)) defines it as an 

exponential function, so that in the following section: 

(11) X€i) = OE DRC j)/DRC Jj )**a 


because F(DR(j)) = DR( j)**a 


with the summation over m-1 inner circlese 
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Table i is a summary of the values of X(i) #rom 
Equation 10 for different values of ae To prepare the Table 
AA has been set to unity, w= 0-7 units, d = 4 parks per 
square unit and DR(j) ranges from 067 to 720 in increments 
of OeZT7e 
Examination of the cells in Table 1 permits one to draw 
behavioural inferences about each value ot ae For those 
values below unity, the nearer parks contribute relatively 
‘Little to the tctal strenath of the alternative-site factore 
‘For an a = 0», approximately 50 percent of the competition to 
some park comes from the large numbers of parks in outer 
zones 8, 9 and 10e Cheung suggested using an exponent of ! 
in his Saskatchewan day-use modele Table i indicates in this 
hypothetical case that, againg the nearer parks play a 
|\relatively small role in adetermining the total alternatives 
|perceived at the origine Specifically, less than 50 percent 
jof the total day~-use competition comes from the first six 
/zonese Thus, if an -exponent less than one is chosen _ for 
modelling purposes, the researcher has; explicitly or 
implicitly, stated that the total number of parks within the 
“maximum ehysical driving range tor day~-use activity is more 
‘important than just the smaller number of closer; more 
accessible parkse For example, when a equals !, even though 
the attractivity-distance ratic is considerably greater tor 
a park in (for example) ring 2 than in ring 10, so that a 
‘park in ring 2 has two to three times the chance of being 
‘chosen than one in ring 10, there are about seven times as 
many parks in ring 10 as in ring 2. As a result, there isa 
higher probability of zone 10 being visited more than zone 
2e 


An interesting consequence cf this would be that a. 
'graph showing numbers of visitors (on the y-axis) against 
destination distance (on the x-axis) would have a positive 
slope. In the past, stuuents of spatial interaction derived 
‘such curves and drew conclusions about the friction of 
distance from the slopee It bears emphasis that the slope of 
‘this curve is So biassed by the spatial distribution of 
alternatives, as is vividly shown in this example, that it 
jis guite useless in estimating the friction of distancee In 
the present example the conclusion from the positive slore 
‘would be that distance had positive rather than negative 
‘effect on usage, even though Equation Ll indicates the 
oppositee Even SO 9 some current research based on more 
“sophisticated methods than the distance decay curve still 
ignores the biassing effect of the spatial distribution of 
alternatives on trip flow datae 

A different inference can he drawn from the results 
when ais greater than unitye Distance and travel costs play 
much more important roles than previously, and can be 
considered more important than the increasing number of 
“parks or distance in intluencing the probable maximum 
distance a user will drivee The increase in the relative 
competitive advantage of nearer sites increases rapidly with 
hisher a-valuese For an a= 2, most of the competition in 
tris example comes from the first three zones; for a = Uy 
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TABLE 1 


SUMMARY OF THE EFFECTS 
OF A CLASS OF ALTERNATIVE-SITE FUNCTIONS 


Percent of Alternative-Site Function 
Value Attributable to Each Ring 


(inner ) ( outer ) 
a 1 v) 3 4, 5 6 7 8 9 10 X( i ) 
0) PHN 4 5 7 11 13 15 16 18 676-8 


3 
1/2 4 6 8 9 10 11 12 13 13 14 33203 
1 10 10 10 10 10 10 10 10 10 10 175-8 
3/2 20 13 12 10 8 8 8 7 7 7 105-5 
% 
4 


2 33 17 12 10 5 5 5 5 2 7308 
3 68 16 6 4a 4 * * * * 5425 
5 93 5 2 * ok * * * * * T7104 


* Less than 1% 


<> <><9<LOKD 


most comes from the first zone; and for a= 5 nearly all 
comes from the very closest parkse 

Further interpretation of the higher a-values suggests 
that they may he appropriate for, for example, the day-use 
activities of small children, the elderly, the handicapped 
and the otherwise immobilee Conversely, very low a-values 
would more logically apply to highly mobile individuals; and 
perhaps to a special class of day-users who may desire to 
visit several facilities in one daye 


CONCLUSION 


The point cf the preceding discussion is that there are 
behavioural implications that need to be considered when 
making what may seem to be a strictly empirical decisione 
The use of an alternative-site function in a day-use model 
necessitates the researcher having an explicit awareness of 
his study population and activitiese A recognition of the 
behavioural or human forces operating in a recreation system 
should be considered, and used to evaluate the implications 
of any given empirical solutione At the extreme, then, it is 
conceivable that a model with a high R? should be rejected 
for planning purposes because of possible misleading or 
nonsensical behavioural interpretations of the empirically 
elegant modele 
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MEASUREMENT OF SUPPLY 
USING NATIONAL INTERVIEW DATA 
ON PARTICIPATION IN OUTDOOR RECREATION ACTIVITIES 


Je Beaman, Se Smith, and Ye Kim 


ABSTRACT 


| A recurring theme in outdoor recreation research is the 
need to develop models to torecast future use levels for 
outdoor recreation -facilitiese However y most aggregate 
demand models describe how demand is related to socio-— 
_eccnomic characteristics of populationse Consequently such 
models are calibrated from data on observed participatione 
Their weakness is that the “observed use Levels" are not 
(only a function of demand but also of supplye 
This paper presents an approach to defining the effects 
of supply on the rate of participation in an activitye The 
effect of supply is measured by finding out the average 
errors in predictions of the participation in an activity 
that took place in €kiven arease It was expected that by 
using a model that did not take into account the 
distribution of supply there would, on the average, be over- 
predictions of the amount of participation in areas with 
"Low supply" for an activity and vice versa for areas with a 
high level of supplye In other words residuals from a 
regression in which supply was not considered were expected 
| to be Significantly positive on the average in some areas 
and significantly negative in other arease 
| Statistically very significant residuals were found ana 
do appear to be associated with availability of supplye 
These were found using 1272 Secretary of State leisure time 
date for 25,506 independent individuals, people from 
different householdse 

One result of the study was to show that a significant 
etfect of supply could not be determined from 1972 CORD 
Study National Survey datae Reasons for this result are 
discussed in a special aprendix to the paper where sample 
Size tormulae and the accuracy ot supply factors are 
discussede It is concluded that to get results that are 
good enough to study how supply factors are determined by 
the supply on the ground 800,000 interviews would be 
mecessary from 60 geographic areaSe However, it is also 
indicated that a well designed analysis with interviews from 
@a few areas (about 8&8) could be successful with only about 
&0,000 interviewSse 
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PURPOSE 


j The purpose of this note is to present research work on 
the development of a quantitative, behaviour-based measure 
of the effects of supply of facilities for participation in 
some activity on participation in that activitye 


INTRODUCTION 


A central concern in planning for outdoor recreation 
involves the development of models to forecast future use 
lewels for facilities that are builte Most forecasting 
models used in recreation research have been destination 
models based on demand factors describing socio-economic 
characteristics of target populations; cost (usually as 
distance); attractiveness or quality of existing and 
proposed sites; and occasionally measures of the numbers of 
alternative facilitiese Such models are calibrated from 
observed participation figures (see Chapters I, II, III, VII 
and IX of this volume in which relevant notes are included )e 
There have been origin models developed but in these it is 
almost invariably only socio-economic variables that are 
considerede However, a critical weakness of this approach 
is that the models developed do not reflect supplye They 
make participation only a function of demand factorse The 
1969 study by Cicchetti, Seneca and Davidson is one of the 
rare cases where a "supply factor" thas been incorporated 
into an analysis so that the level of supply does influence 
the amount of participation predictede 

The reason that other quantitative studies reported on 
in this volume such as those on the use of ANOVA and AID 
models (TN 12, TN 20) do not relate peoples, total behaviour 
to the amount of supply around them is not simplee A major 
problem is the problem encountered by Cicchetti, Seneca and 
Davidsone That was that information on supply which was at 
all adequate for use in a model was rarely availablee The 
research reported on in TN 16 and the Facility Inventory; 
reported on in Volume III, show that CORD Study research has 
focused on how to inventory supply that is on the grounde 
The Lack of success of the Canadian Outdoor Recreation 
Demand Study Facility Inventory (as reported on in Volume 
Tit) meant that inventory data were not available for 
building the Kind of model proposed by Cicchetti, Seneca and 
Davidson, and suggested by Knetsch for the CORD Study demand 
analysis (see Chapter I )e 

In the way of introduction one final point seems 


importante A traditional approach to modelling supply 
effects on participation has been to intuitively decide on 
some physical resource as a Limiting factor for 


Participation ina given activity at a given locatione For 
example, one may choose as a measure of "camping supply" the 
number of campsites ina ring around a population centre, 
divided by their distance from that center, Summed over a 
number of concentric ringsSe This number can then be used in 
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a model to adjust for probable expansion or contraction of 
supplye A concern raised by this approach, however, is just 
why one measure of supply is used rather than another onee 
In modelling it is frequently difficult to say, a priori, 
which model is better among a multitude of alternativese A 
researcher may even be able to decide which class of models 
is more appropriate from a behavioural point of view, but he 
still has to choose between many possible permutations of a 
given modele An empiricist may not share this problem of 
choosing the "best" model since his decision can be made on 
la straightforward and objective basis--the best model is the 
one which most closely fits observed datae This choice- 
making tactic has a major weakness for planning 
_purposes-——the model chosen onan empirical basis is not 
necessarily generalizable to other situations of recreation 
supply-demand systemse This difficulty of deciding on a 
most appropriate model is also discussed for related topics 
in TN 3, 125 199 205 27, 30 and 356¢ 

| The supply factor computation described in this paper 
thas the desirable quality of avoiding the difficulty of 
having to select among several formulationse Further, once 
aosupply factor is obtained it may be possible to explain 
this empirically derived factor by using conventional supply 
measure formulaee For those regions in which there is 
evidence of supply effects on participation it may be 
possible to examine the levels of facility and resource 
development logically or intuitively associated with those 
activities exhibiting possible supply effectse Ideally this 
analysis would lead to the further identification of what 
natural and man-made resources (in terms of quantity, 
quality, Location, costs, advertising, management policies 
and so on) define supply for a given activitye 


HISTORY OF THE PRESENT WORK 
OF DEVELOPING A MEASURE CF SUPPLY ON PARTICIPATION: 
INTRODUCTION TO THE METHODOLOGY 


Work on aversion of this paper began in 197G-.6 This 
work began using 1972 National Survey information broken 
down so that observations were associated with 75 different 
geographical areas in Canada (on these data and geographic 
codings on these data see Volume III )e As work on various 
Canadian Outdoor Recreation Demand Study projects proceededy, 
it became clear that considerations of the way that people 
reacted to alternative facilities made it important to 
derive a measure of how people perceived supplye it was 
recognized that this should not be some ad hoc formulation 
that a researcher dreamed up such as the following supply 
measure for an origin area for some activity 


(1) Supply Measure = OE {(attractiveness of a site )**A/ 
(distance from the origin to the site )**B} 
WHERE the sum is over all of an origin's sites for a given 
activitye 
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The general kind of supply measure just defined is 
called an alternative factor elsewhere (TN 1, TN 3) and used 
in other ways in other research (TN 5, TN 11)-e¢ Regardless it 
suggests that a person perceives the amount of supply based 
on how far it is from him and how attractive it ise The 
issue of concern here is: “Is the function that was chosen 
correct?" An approach that may ultimately tlead to an answer. 
is not to start with a function but to start with 
assumptions and proceed to derive values which reflect 
peoples, response to supplye Having such values’ or supply 
measures for each of a number of geographic locations allows. 
one to work backwards to find the function that explains the 
measures in terms of what is on the ground (see TN 4 for 
Cesario's approach to explaining attractiveness values )e | 

The conceptual strategy adopted was to measure the 
effects of supply in several steps: 


(1) Assume that supply was homogeneously distributed in al 
regions in Canada;. 

(2) Use an existing model (such as the one reported on in TN 
12) to make estimates of the amount of participation in 
some activity in each of these regions; 

(3) Obtain actual participation Levels and compare these to 
the predictions; 

(4) Examine the differences between actual and predicted 
participation figures to see if this difference shows 
that there is a statistically significant ditference 
between observation and predictions that may reasonably 
be accounted for by the level of supply in the different 
geographical areas considerede 


The kind of supply factor being described can be 
defined as follows: 
SF(i ) Supply factor area i 
= @E (observation —- predictions )/N(i) 


WHERE the sum is over all persons from i for whom there are 
data in the survey used and N(i) is the total number 
of such people in ie 


Given the assumptions just cited a negative sum of 
residuals and thus a negative average value of the residuals 
could be expected for a region when observed use levels are 
less than the predictede Because this average is negative 
even taking into account the socio-economic factors it is 
reasonable to refer to this average value of residuals as a 
supply factor for the given region for the activity 


considered sub ject to some qualifications introduced 
subsequentlye It an area has abundance facilities fcr the 
activity considered, if it exceeds a national average and 


these are being usedy it may be expected that observed 
participation will exceed the national average and in this 
case the average value of a residual will be positivee SO, 
the supply factor defined by the average value of residuais 
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for the geographic area will be positivee 


There are some points of criticism that can be raised 


about the rather naive formulation just introduced, but 


these are only raised in the discussion section of the 


papere Now further background on the actual estimation 
attempts that were carried out in an effort to obtain supply 


measures and to verify that these were statistically 
Significant is introducede This is because there has been a 
history of problems in developing what may already sound to 
the reader as a very simple measure of how people respond to 
supplye 

Initially, when predictions were made about peoples, 
behaviour in various outdoor activities both using the model 
indicated in TN 12 and using a model generated by the 


Michigan AID program (see TN 20) the supply effects 
determined were highly variablee This was true even though 


they were based on interview information from 4,000 people, 
and was true both in trying to explain participation and 
non—-participation and in trying to explain the amount of 
participatione The reason that the amount of participation 
was finally disregarded in the analysis is that it is known 
that in Canadian Outdoor Recreation Demand Study National 
Survey data people's reported volume of participation (1) 
may reflect enthusiasm for an activity which destroys the 
relationship between the total volume of supply in an area 
and participation; (2) the amount of participation reported 
is known to be very unreliable and (3) it seemed more 
plausible when this research was being carried out to assess 
the amount of supply perceived to be available by assessing 
general perceptions of this supply than to let the analysis 
be dominated by a fairly high level of participation by some 
people in some geographic areas whereas in other geographic 
areas there may be a quite general but lower level of 
participation by the peoplee a 

Kegardless, the initial analysis attempt involved 
developing both frequency of participation based supply 
measures and participation or non-participation based supply 
measurese The results produced were disappointinge Atter 
much computer programming and sorting out of residuals for a 
large number of activities, it was concluded that 
Signiticant supply factors had been estimatede However, 
these supply factors had been estimated on the assumption 
that the CORD Study National Survey information resulted 
trom interviews in 4,000 different households to get the 
information on 4,000 different individualSse Subsequent 
checking of this matter showed that the commercial company 
that carried out the interviews, interviewed more than one 
person ina large number of residencese This introduced a 
correlation between participation figures that resulted ina 
Significant relation between supp ly and participation 
becoming an insignificant relationshipe Many hours of work 
carrying out rather sophisticated statistical tests to show 
a result significant at the 5% Level was simply Lost! 

The only option that offered any promise for measuring 
the effect of supply on participation was to use a data set 
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in which the kind of intercorrelation just cited did not. 
occur, ideally a data set which was much larger so that. 
there would be a much better chance of observing a supply 
effecte One problem with 4,000 observations and 75. 
geographic areas was that there is not much information. 
about each area and consequently the variation of each 
supply factor tends to be Large in comparison to anything 
that is to be explained (see the Appendix )e | 

Fortunately in 1972, data were collected by Secretary 
of State, Citizenship Branch, on participation in Leisure | 
activitiese These data included information on 
participation in some outdoor activities and thus these data 
offered good potential for analysiSe Unfortunately, the 
52,000 interviews carried out in this study were not carried 
out in different residences but it was possible to select. 
about 25,000 interviews from the Secretary of State data set 
on the basis of a random selection ot single interviews out 
of householdse This gave a reduced tape on which all of the) 
interviews could be considered to be independente So after 
supply factor research work had continued for over two years 
a new data set was adoptede | 

These data from the Secretary of State tape which were | 
actually used in the analysis are indicated in Figures 1 and 
Ze The sixty-two geographic areas used are relatively. 
homogeneous economic areas of the different Canadian | 
provinces as defined by Statistics Canada for the 1972) 
Labour Force Surveye | 

Now, one final point before proceeding to presenting 
resultse AS research work on this project went ahead, it. 
was recognized that estimation need not always be carried 
out by producing residuals and using a special program to 
process these in a number of stepSse The reader who is 
tamiliar with the theory behind least squares estimation. 
will know that if one sets up a model such as the one 
indicated by the equation following then he can estimate the. 
coefficients in the model by simply using a regression 
analysis program: 


Probability of person in = M + socio-economic effects 
region r with given + region r supply effects 
socio-economic charac— + error 


teristics participating 
WHERE (MM + ee, e) ds- as. defined+in+s IN: Ie 


The regional supply effect is the regression coefficient 
that "explains" how, on the average, the region differs from 
an average region after correcting for socio-economic 
effectse 

In actually carrying out computations to determine if 
the supply factors indicated in the equation shown above 
were Significant, first a regression was carried out in 
which no supply factors were includede This allows one to 
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FIGURE 2 


SECRETARY OF STATE DATA VARIABLES AND THEIR LEVELS 


AS USED 
VARIABLES 
IDENTIFICATION DESCRIPTION 


VAROO5 


VAROO6 


VAROO7 


VARO59 


VARO60 


Household* 


ORIGIN 


FISHING/HUNT ING 


SNOWSKI ING 


*Note: 
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Population of 
Sampling Area 


IN DERIVING SUPPLY FACTORS 


LEVELS OF VARIABLES 


IDENTIFICATION DESCRIPTION 

l Under 20,000 

2 20 ,000-49,999 

3 50 ,000-99,,999 

4 100,000 and Over 


Gender of | ' Female 
Respondent 2 Male | 
Age of ] Under 16 
Respondent Z 17-24 ) 
3 25-44 | 
4 4S and Over : 
Marital Status l Single 
2 Married 
3 Other 
Relation to the l Spouse 
Head of Household Z Son of Daughter 
3 Relative, Boarder, 
Employee, Other 
4 Head 
Number of Individuals l l 
ln Household 2 2 
Interviewed 3 3 or 4 
4 5 and Over 
Origin of Respondent 1-62 For description 
see Figure 2 
Participation in Fishing 0 Non participant 
and or Hunting l Participant 
Participation in 0 Non participant 


Snowskiing 


This variable was not on the original tape. 
program that selected | 
same time counted the number of 
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individual from each household and at the 


— 


Participant 


It was generaled by a 


individuals interviewed in the household. 
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determine the variance explained when supply is considered 
not to be importante It also allow one to know how much 
variance remains to be explainede Then, when 62 additional 
parameters were introduced, the 62 geographic areas 
(indicated in Figure 1) were considered and supply factors 
computed for theme To assess’ statistical significance of 
these supply factors, one need only ask whether’ the 
introduction of these parameters has resulted in more 
variance being explained than would be expected to be 
explained by chance (for a similar test see TN 27 )e In this 
particular case the introduction of the supply factors as 
indicated subsequently should have explained less than 6&5 
thousandths of the variance that remained to be explained 
after socio-economic factors were considerede Yet far more 
variance than this was explainede 

To be very specific the regression analyses were 
carried out as described because it was more convenient to 
carry out regressionSd, with only socio-economic variables 
and with both socio-economic variables and supply effects 
than to follow the more complicated analysis procedure 
described subsequently for the AID analysise This is 
because with one run of the regression program made with a 
few control cards results were generated. 

However, one caution is appropriate at this pointe The 
procedure just outlined is perfectly adequate for 
determining if there are significant supply effectse But, 
there is the remote possibility that the supply factors 
computed may have variance in common with the socio-economic 
effects which are included in the modele If one wanted to 
get supply factors that were absolutely not "polluted" by 
socio-economic effects, it would be necessary to make a 
first regression run ana produce the residuals from this 
regression rune Then the supply factor would be computed 
directly from the residuals (by a rewression or some less 
costly procedure )e The point is that when a regression is 
carried out and then the residuals are subjected to further 
analysis, the variance that is found in the residuals is 
variance which is not in common with the variance that has 
previously been explained by the socio-economic variables 
used in the original regressione This occurs because as is 
well known for least-squares estimation, the residuals are 
uncorrelated with, or in other words, orthogonal to the 
parameter spacee Regardless, examination of Table 2 shown 
clearly that whether oor not supply factor effects were 
computed the socio-economic effects had roughly the same 
value, Cege in row Il, one sees that the effects for snow 
Skiing of being at level 1 of variable 59 are .0178 and 
—e0160 which differ by less than e003, which is the standard 
duration tor both coefficientSe 

If one carries out an analysis of the relation between 
socio-economic variables ana participation using the 
Michigan AID program, when the AID program has been used and 
the residuals from the AID analysis have been written out on 
magnetic tape; one need only carry out an analysis of the 
residuals based on the following equation to see 7 hy 
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significant supply factors can be estimated: 


Residual for a person = (regional supply effect for r) 
ina region r + error 


SFC i) + e 


| In actual fact when AID was used in estimating supply 
effects and the residual differences between observations 
and predictions were written on magnetic tape so that these 
could be analysed using the regression program it was 
realized that the regression program need not be usede 
Because only 1 supply effect applied to each individual and 
the residuals already had a mean of zero overall a special 
purpose "regression program" was written which calculated 
the supply factors, presented in Table 3e This program took 
advantage of the fact that each supply factor was only a 
weighted sum of residualse Once the "residual tape" which 
had region as an identifier was sorted by region 
observations were read one atter the other with supply 
factors being produced as sub totals and total sums of 
Squares explained being accumulated throughout the rune The 
supply factors program was also used to compute sums of 
squares explained and sums squares’ that could not be 
explained by supply factors (see Tables 1, 2, and dQJI)e It 
then became possible as described before to compare the sums 
of squares explained with those that could be expected to be 
explained by chanceée 
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SUMMARY RESULTS ON AID ANALYSES 


SNOW SKIING 


No. of People 
in the aroup 


41 
1019 
178 
513 
355 
1077 
54h 
1976 
2253 
680 
558 
Bal 
1259 
10190 
162 


TABLE 2 


Mean for Group 
the group 
0.24] 16 
0.184 28 
0.174 20 
0.140 a 
OF 30 15 
OPO 18 
0.096 29 
0.088 19 
0.073 y) 
0.063 23 
0.045 " 
0.043 26 
ea 24 
0.014 27 
DA O12 25 
CHAPTER V 


No. of people 
in the group 


2/7 
1 
335 
2023 
2008 
2167 
2455 
826 
1565 
400 
367 
3740 
Siok 
1088 
2289 


HUNTING AND FISHING 


Mean for 
the grou 


556 
545 
.540 
453 
. 388 
FFUG 
.285 
o2/6 
£235 
«2.13 
. 166 
.149 
.088 
.070 
.033 


(2) (Se) (@y (Se) (2) (oe) (Soles (ey (se) (ea) 
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TN 


But, what is being alluded to becomes more clear when 
the actual results of analysis are presented, for now it 
should suffice to indicate that the actual regressions 
performed produce a weighted sum of residualse If, for 
example, socio-economic effects are estimated and eliminated 
then supply effects are computed or if the socio-economic 
variables are included supply factors are determined by a 
formula which indicates that a supply effect is a weighted 
sum of residuals: 


Supply effect for = ( dE Wi) residual(i))/( OE Wid) 
region r for an 
activity 


The weighting of residuals to reflect the variance in 
observations is mentioned in TN 6 and 20 and discussed in 
detail in the Cicchetti and Smith article which appears as 
an appendix to this volumee 


RESULTS 


The results of this analysis are presented in Tables i 
through 5 and Figure le Tables 3 and 4 permit the reader to 
see in certain respects how different ANOVA and AID analyses 
comparee In particular Table 2 presents regression 
coefficients which are Laid out in such a way that ome can 
see that, whether or not supply factors are incorporated 
into the Snow skiing model or the hunting and/or fishing 
participation model, the effects of socio-economic variables 
remain the samee In particular if one looks at the first 
and third columns of Table 1 one finds the effect for tevel 
1 of VAROO6 (being female) indicated by a coefficient of — 
00766 with a relatively small standard of deviation of 
0026 This value occurred when only socio-economic effects 
were considered in developing a modele This coefficient 
only shifted in value to a minus e00759 when origin etfects 
were introducede This shift is well within one standard 
deviatione A similarity of coefficients is even more 
striking when one looks at effects for different levels of 
variable O07- These shows the effects of age on 
participatione For example the coefficients for the 
youngest age group is e0288 whether or not origin effects 
have been incorporated into the snow skiing modele 

In Tables 1 and 5, it will be noticed that the F —- test 
for the regressions are givene All of the models developed 
have F - test values which are very highly significante At 
the same time all of the models have R* value which are low 
enough that many people would consider the models quite 
unacceptable (on this matter see particularly IN 35 and 
comments in TN 6 )e In reality the low R? values’ only 
reflect the problem involved in predicting the actions of an 
individuale As shown in IN 6 when predictions are made for 
many people the accuracy of results on what percent wilt 
participate in an activity can be plus or minus a few 
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percent using CORD Study datae 
| Those readers who are familiar with the AID program 
will understand why the coefficients from the AID program 
that explain behavior in terms of socio-economic 
characteristics are not included in Table 2. Those who are 
not familiar with the AID program may wish to refer to CORD 
Study TN 4 or 27 or to a reference on the AID program (see 
Reference 20)e When the AID Program is used people are 
grouped into relatively homogeneous clusters based on 
complicated collections of socio-economic characteristics 
which serve to identify people for whom an appropriate 
estimate of their probability of participating in an 
activity is the group mean for the terminal group they are 
assigned to hased on the socio-economic characteristics they 
havee The important information on AID analysis which can 
be displayed ina summary form is the size of the group 
which are identified by the AID Program and the various 
means for these pgroupse Actually these means are _ the 
prediction that would be made if one wished to estimate the 
probability participation for the people ina particular 
groupe 

From Table 2 it can be seen that there was a mean 
participation for the whole universe of .05344 (the average 
value of the unweighted dependent variable) for snow skiinge 

From Table 3, one can see that there was one group of 
411 people identified for which a most appropriate 
prediction of their probability of participation was e24e 
Also, around 2,500 people out of the 25,000 for whom the 
model was developed are in groups 20, 21, 6, 2 and 16 for 
which the mean for the group is more than twice the mean of 
the dependent variablee One notes a similar pattern for the 
AID results on hunting and fishinge However for hunting, it 
is much more Striking because based on the mean of the 
dependent variable of about e2 one suspects that a person 
has one chance in five of being a hunter yet in the AID 


analysis certain socio-economic characteristics are 
sufficient to identify people to the point that one can 
reasonably assign these people one chance in two of 


participating in huntinge At the other extreme with respect 
to hunting and snow skiing one can see that there are large 
numbers of people in groups with group means that are less 


than half of the average participation rate for the 
population, below e10 and below 2-027 respectivelye 

The results on the AiD analysis that have been 
presented are obviously unsatisfactory to one who is 


interested in why a certain group has a certain group meane 
However, to present the detailed results on how different 
groups were defined in terms of Socio-economic variables and 
to comment on this takes one into matters which are quite 
irrelevant to the general theme of this papere For that 
reason further AID results are not included heree The data 
used in this analysis are available from the Waterloo 
Research Institute, Leisure Studies Data Bank should someone 
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wish to pursue the matter of why certain AID clusters occure 

Table 4 is where one for the first time sees the actual 
supply factors which were calculated in this analysise One 
can manually compare the supply factors to see that the 
supply factors computed when AID was used to explain 
participation in terms of socio-economic variables agrees 
with supply factors computed when ANOVA was usede However, 
Table 5 provides weighted correlation coefficients which 
will probably give one a better idea, at least a 
quantitative idea, of the degree of agreement between the 
different supply factorse Standard deviations in the supply 
factors were used as weights in computing the correlation 
coefficientse The correlation coefficients between the 3 
sets of snowing skiing supply factor and the four sets of 
factors for hunting and fishing being in the general order 
of magnitude of 9 indicated that any one of the supply 
factors explains around 80% of the variance in the other 
supply factors for ‘the same activitye Andy given the 
standard deviations in the supply factors this level of 
explanation is all that can be expectede So, the results of 
the analysis appear very good in one respecte That is that 
the supply factors computed from the AID and the ANOVA 
analyses agree very welle One could say that this agreement 
shows something about the reliability of the supply factors. 
when they are determined in a similar waye | 

But one may be concerned about the validity of these 
supply factorse Does the magnitude of the different supply 
factors show something that would be expected in terms of 
what is Known about the distribution of supply for snow 
Skiing and for hunting and fishing in Canada? Now, rather 
than meticulously examining the numbers in Table 4 in trying, 
to answer this question one can look at Figure 1 in which 
origin information and suoply factors are displayed 
togethere There one sees that people hunt and fish and also 
snow ski in the mountains of Western Canadae This is no} 
surprisee One sees that skiing is below the national 
average in the Canadian prairies where supply conditions are 
poor and it is well above the national average in the areas) 
around Quebec City and Montreal where there is an abundance 
of ski slopese Again one sees large negative supply factors. 
in Southern Ontario and the Maritimes which are areas which! 
offer few opportunities for snow Skiing because of the 
weathere It is tleft for the reader to examine the 
differentials in detail to decide if certain details 
correspond with what he knows to be the case by experiencee. 
However in this examination the reader should keep in mind. 
that some of the regression coefficients supply factors do- 
have rather large variancese For example, the difference in 
the supply factors for Regina and Saskatoon, Saskatchewan. 
could be taken as reflecting the existence of the Blackstrap 
"Mountain" ski development near Saskatoon while Regina has 
no comparable supply but this is a tenuous conclusion 
because it is based on rather weak evidencee Still Thunder 
Bay and other special skiing areas do regularly appear to 
have the supply factor which one might expect. 


a 
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Given the statistical significance of many of the 
supply factors and given the way that the relationship 


lt associated makes sense in terms of what most Canadians know 
about supply for the activities considered, there may appear 
to be Little reason to give statistical tests for the 
significance of supply factorse However, such significance 
tests are presented in Table 6¢ There one sees that all of 
the supply factors computed are highly significant in terms 
i of the sums of squares which they explain above and beyond 
sum of the squares that are explained by socio-economic 
variables. However, the reader may find Table 6 
particularly distressing in terms of the R* values presented 
there. The feeling may be that it was bad enough to see R®* 
fvalues of eO05 or e10 for the socio-economic effect models 
j but the values for supply factors are absolutely too low to 
| be meaningtfule Actually, they are not too low to be 
meaningfule The fact that supply factors explain another 
(1-4% of the variance in participation after socio-economic 
variables have explained 5% really shows that supply factors 
are too important to ignore if models are to be developed 
that are to be at all accurate in explaining people's 
participation in activities in particular areas of Canadae 

| One can see the importance of the supply factors; even 
though they only increase R? by 20 to 30%, in terms of the 
following argumente Consider that R? is increased from V by 
an amount rV where ris say e25 (as it was when supply 
factors were introduced into the unweighted model to explain 
participation in hunting and/or fishing)-e It may be noted 
that for the relation defined above to hold the 
contributions of the supply factor to R? are not defined by 
r but by the square root of re This is the case because the 
sums of the squares of these effects, which are orthogonal 
to the socio-economic effects must add up to rV and if the 
socio-economic factors have an amplitude X and the supply 
factor an amplitude SQRT(r)X the sum of square is X@ + 
(rx)? So, in practical terms if r is e2 or e3 this implies 
that supply factors are on the average 40 to 60% of the 
magnitude of socio-economic factorse It effects of this 
size are ignored in making predictions for particular 
| geographic areas one can see that substantial errors could 
resulte However, one must recognize that the mean 
participation level is not considered when R* is computede 
Thus, though the preceding shows that the small 
contributions of supply to R? should not result in one 
considering them to be unimportant compared to socio- 
economic variables, other matters should be considered in 
determining their overall importance in making correct 
predictionse Consequently, this matter is returned to in 
the discussion section of this papere 
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DISCUSSION 


The results presented in the last section are quite 
definitivee They leave no doubt that there are significant 
supply effects that can be calculated for different regions 
of Canada that help explain the amount of participation in 
various activitiese However, as is made clear in the 
appendix of this note, extremely large sample sizes are 
required to measure these supply effects accurately enough 
so that they can be subjected to secondary analysis focusing 
on how actual supply “on the ground" is related to perceived 
supplye Relating the regional supply effects to actual "on 
the ground" supplies or measures of perceived supply can, 
and should be, the goal of future worke 

This future work, however, Should proceed with cautione 
As was pointed out earlier, original supply factor research 
involved information on the incidence of participation (yes 


or no )e Some measure of supply based on the frequency of 
participation might also be developede But if it is, it is 
plausible that two different analyses, one based on 


incidence and one based on frequency, will result in 
different supply factorse If So, the implication is that 
different models should be used by planners according to the 
measures they are trying to predicte This is what is 
implied in the Cicchitti, Senecca and Davidson work (see TN 
34 )e 

Another matter of importance regarding the models is 
that many of the socio-economic variables considered as 
causing or shaping participation are not truly causale More 
precisely, some are antecedents of other causal variablese 
It is only later in this volume in the review of Chapter VII 
and in Chapter IX of this volume that any comment is made 
about the need for a better and more precise understanding 
of causal variables in modellinge For example, age isa 
partial antecedent to educations which is in tura a partial 
antecedent to incomee The result is that the socio-economic 
variables included in a regression model are not truly 
independent of each othere Participation is a dynamic 


phenomenon and the type of analysis presented in this paper. 


is static or cross-sectionale At best, the solutions 
presented in this paper are approximations of an equilibrium 
condition and approximations are where research must starte 
For the purposes of this paper and from an empirical 
viewpoint, the measurement of supply effects does not depend 
on the internal structural validity of the relation between 
Socio-economic variables implied by the models usede The 
critical point for the work here is to Simply include 
certain variablese In other words, there is a need for more 
work on identifying and measuring the causal variables of 
participation, but this is not a crucial issue in this 
papere 

A related issue concerning the development and 
refinement of a model is that the supply factors developed 
here were achieved under the assumption that all people have 
Knowledge of the supply of facilities for given activitiese 
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Future computations could be based on interviews with only 
those people who really do know something about the supply 
of facilities. Or, more generally, a model could be 
formulated in such a way that the response of a given person 
about his participation is weighted according to the amount 
of knowledge he has about the supply of facilities for the 
activity under consideratione A model based on this type of 
formulation might from an aggregate view adequately combine 
considerations both of the incidence of and the frequency of 


participatione 
The supply-knowledge component should probably be 
entered into a projection model as a multiplicative 


component rather than as an additive onee The basis for 
this is that if the user perception of supply is zero, there 
will be no participatione This effect is missed if the 
supply—-knowledge component is additivee If the reader is 
going to get into the kind of considerations just raised, he 
should also be aware of an issue discussed by Cicchetti, 
Seneca and Davidsone These authors suggested that people 
who participate at different rates should actually be 
studied by using separate modelse A similar conclusion is 
implied in the cluster analysis presented at length in CORD 
‘TN 10 and commented on in TN s 3y 32 and 37e In these Notes 
the issue of participation is not simply related to a single 
activity, but rather the question is raised (as it was in 
Reference 00) whether or not groups of activities should 

| As indicated above, the basic computational strategy 
used here utilizes actual participation data to get a 
measure of how human behaviour deviates from a model which 
assumes supply is uniformly distributede ws § is necessary 
that the other assumptions behind the supply factor 
computation should be made explicit since they concern the 
interpretation of the supply factor derived heree First it 
is assumed that there are no regional effects resulting from 
a unique cultural milieue Specifically, it is assumed that 
cultural factors within any of the 62 regions considered in 
this paper do not result in a change in participation (by 
modifying the expression of causal socio-economic variables) 
that might be confused with supply variationse Similarly, 
it is assumed that any socio-economic variable not included 
in the analysis either has no effect on recreation 
participation or it averages out within each of the regionSe 
Also it is assumed that supply factors do not explicitly 
measure the effect of weather in a given yeare For example, 
a cold summer in the Maritimes and good weather in the 
Prairie Provinces in the year data were collected could 
result in a supply factor showing that the supply of 
opportunities was much larger in the Prairies than it was in 
the Maritimes because of the effect of weather in modifying 
average participation patternse Broadly interpreted, this 
would be truce The opportunities for a satisfactory 
recreation experience would be greater in those areas with 
good weather, but, this interpretation of supply should not 
be confused with “what is on the ground". This problem of 
interpretation must be kept in mind continually, and either 
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considerations of the effects of weather should be taken 
into account or evidence should be presented which suggests 
that weather had negligible distorting effects with respect 
to a given activitye 

Finally and probably of most practical importance is 
that though supply factors do not add much to R® by their 
inclusion in a model they are absolutely crucial parameterse 
Consider for example that for snow skiing there was an 
average participation rate of about eO0Se Now, aS one can 
see from Figure 3 for much of the Prairies and Atlantic 
Canada the supply factor had a value of around - e9256 If 
an ANOVA model were used to make predictions based on 
national parameters they would imply about 1 person in 20 
participated when actually only about 1 in 40 dide The 
magnitude of error which would occur as Just indicated is 
certainly not acceptable in many cases (possibly most) if 
planning is to be based on estimatese 

The preceeding example does not infact show how poor 
estimates will be in an exceptional case put rather, since 
many supply effects are larger in absolute value than e15 
and some larger than .35 in absolute value, large errors in 
predictions due to failure to consider supply factors can be 
expected to be usual rather than unusual. Certainly the 
result just cited show that the accurracy estimated using 
the procedure given in TN 6 is illusory if a supply factor 


is not included in the modet used to make predictionse The 


supply factor must be included to remove model biase If 


supply factors are independently determined (either from 


residuals of a regression where Socio-economic effect were 
determined or from a separate data set), then the kind of 


variance estimates obtained in TN 6 need only be modified by > 
adding on variance related to error in the supply factors to | 


deterime accuracy estimates: 


Added variance = dE (( Number of people in area r) x 
(Variance in supply factor for r)) 


WHERE the sum is over all areas being considered. 


Obviously, when supply factors are not accurately known the 
variance from this source can be larger than error from 
other coefficients. 


CONCLUSIONS 


This TN has’7 shown that, under aie set of reasonable 
assumptions, highly significant supply factors can be 
derived to Show how participation varies among regions 
within Canada. Unfortunately this variation can be the 
result of variations in Physical supply Or, possibly, of 
cultural differences or other variationse The authors feel, 
however, that supply differences are the most important 
cause of variation for the activities considered. So a next 
step is to try to explain the effects measured on the basis 
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of physical supply data modified by by variables reflecting 
such factors as advertisinge Such work is quite possible 
because some of the coefficients measuring the effects of 
supply are accurate enough that they can be compared to 
regional inventory study datae 


In conclusion, this paper has achieved its primary 


purpose of deriving supply factorse Some other interesting 
conclusions have been reached about the adequancy of the 
model, data requirements and so on which should be of help 
to researchers who may wish to further this worke In 
particular this paper has given some limited answers about 
_how to proceed in defining a supply parameter for demand 
analysis, what expected levels of explained variance will be 
and so One Possibly of most importance are the guidelines 
in the Appendix which show the number of observations needed 


in any region before a researcher can expect to have much 


chance of deriving adequately accurate supply factors to 


carry out further analysise With these guidelines studies 


| which go beyond this one can be designed to succeed rather 


than to faile 
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APPENDIX 


THE PROBLEM OF HAVING ENOUGH OBSERVATIONS 
ON A DESTINATION AREA 
THAT SUPPLY EFFECTS WILL BE STATISTICALLY SIGNIFICANT 
AND/OR ACCURATE FOR FURTHER ANALYSIS 


Consider that a set of supply effects r(i)"s are to be 
computed for a number, Ny, of subareas of a country or 
provincee Let an estimate of the participation that would 
occur if supply were at the same level in all regions (and 
there were no cultural factors, etce) be PE(Ci,sx) for an 
individual x in region ie Then, one can argue that § an 
appropriate estimate of the real Probability of person x 
participating is: 


PCigx) = PECiyx) + r(i) 


Given the equation above, it is possible to derive some 
equations that can be used: 


(1) to determine if the variances in actual estimated r( id is 
are what they should be expected to be and 

(2) to determine the approximate number of observations 
necessary in a region i to have a certain probability 
that a supply effect will be estimated with, say, a SeDe 
RC i), about the value of r( i) (eege SeDe RC i) approx = 
el or less with a certain probability )e 


Still it must be recognized that one must be interested 
in having a_ general accuracy level for a collection of 
geographic areas because supply effects are calculated 
around a certain mean and are determined in such a way that. 
they add up to zeroe This means that some supply effects 
are going to be small and others are going to be large 
Simply by virtue of the fact that one area has an “amount 
of" supply which is close to a national average and another 
area deviates quite substantially from thise So, it is 
reasonable to consider that in research design one should 
have about the same number of interviews in the different 
geographic areas in which there are roughly the same number 
of people so no matter which supply factor turns out to be 
large and which turns out to be small the Larger ones will 
meet some accuracy criteria in terms of SeDe R(i)/r(i)e 

In expressing error the acceptable amount of error can 
most conveniently be expressed with respect to some measure 
of the average deviation of the supply factors from their 
overall mean value of zero. One convenient measure of this 
type is R = (1/N) times the sum of the absolute values of 
the supply factors: in other words &@ convenient measure 
against which to assess error is the average absolute value 
of the supply factors. This number is in some way a measure 
of on the average how large the negative and positive 
deviations about the mean supply factor values of Zero tend 
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io be so let: 
R(i) = (1/2N) OE |] ri) | 


Now consider that in most regions the average value of 
PE(i,»x) over all persons in i is near to PBAR (ege + 20%) 
and that by definition R/PBAR approx = PRO1. To be 
specific, if the average participation rate for hunting for 
males is e15 and R = e05 then PROL = e33 which means that 
the mean supply factor is of such a size that the effect of 
its average value is a + 33% change in participatione 

Further regarding notation, if one is concerned with 
the accuracy of the r(i)'s the concern can be expressed by 
indicating that the variance in an r(i) with an estimated 
value of about R should be: 


PRO2 = 
(standard deviation in r( i) with an estimated value of R)/R 


and, in the context of making estimates the standard 
deviations in r(i)*s are influenced by: 


(1) The accuracy with which PBAR is estimated (which can be 
estimated by methods described in TN 6 which can be 
approximated as indicated subsequently) and 

(2) The variance in the given r(i) about the mean referred 
to in point (1) immediately precedinge 


Now, with N regions and M observations in each, 
participation PBAR may be considered to be estimated based 
on MXN observation of a zero-one variable: in other words 
PBAR = (# of participants )/MXN- It is well known that this 
type of estimate has a variance approx = PBAR(1—PBAR)/ MXNe 
As for the variance in r(i) with a value of Ry, for the 
region r(i) there are M zero-one observations which by 


assumption are from a distribution with p = PBAR +t R Soy» 
again quoting the well known formula, the variance in r(i) 
given PBAR approx = P(1-P)/Me. One must consider PBAR 


because the results of a particular analysis is being 
considered and the value of it in this case depends on the 
statistical deviation to be expected in r(i) measured from 
the true PBAR,y E( PBAR) and on the additional variance 
introduced into the estimate of r(i) because PBAR is 
estimatede Adding the two independent variance elements: 


V = variance in r(i) = PBAR (1-PBAR) + P( 1-P) 


MXN M 
If N is 20 or 30 and P(1-P) is not much larger than 


PBAR( 1—-PBAR) (which is most unlikely), then the first term 
on the right can be ignored so: 
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V approx = (PBAR + R) (1-PBAR + R) 


‘ N 


And if V*¥*1/2 ( = SDR, the standard deviation expected 
in an r(i) of size R) is to be (PRO2)R and R is to be 
(PBAR )PROI for the reason introduced earlier: 


{ (PRO2)( PBAR )PRO1 J]2 = (PBAR + (PBAR) PRO1) + 
(PBAR + (PBAR)PRO12 )/M 


Multiplying through by M and dropping the (PBAR + R)2 
(1) because this will usually be 1/4 or less and thus will 
be negligible compared to the other term on the right and 
also (2) because the results have to do with a statistical 
approximation: 


M( PRO2 )*( PRO1 )?PBAR2 approx = PBAR(1 + PRO1) 


MC PBAR) = (1 + PRO1) 


( PRO1*( PRO2 )Z ) 


The formula given above allows one to show why some of 
the r(i)*s given in the paper have the ltevel of accuracy 
that they do and why it was a relatively hopeless matter to 
estimate supply effects for 75 areas based on 4,000 
observations (the attempt described at the beginning of the 
paper )e These examples allow one to see how the formula may 
be used to estimate the size of sample necessary to estimate 
r(i)*s accurately enough that secondary analysis to 
determine how r(i)"*s are related to what is on the ground 
can be carried out successfullye 

One should notice in what follows that it is critical 
that one Know rough values of PBAR and of the r(i)"*s or be 
willing to make the success of potentially very expensive 
research dependent on the assumption that PBAR and r(i)'*s 
will have a certain range of valuese Furthermore, it should 
be noted that the appropriate value of the constant PRO1I is 
related to the choice of PBAR and r(i)syse The value of PRO2 
is independent of the other values in the sense that it is 
hard to visualize a successful analysis of say 50 or so 
residuals to relate them to what is on the ground unless 
PRO2 is e1 and it should probably be more in the range of 
05 or 02-6 Its value must depend on what "accuracy in the 
r(i),s" is necessary to achieve the analysis objectivese 

To begin with the matter of the accuracy of supply 
factors, even the possibility of detecting them with 4,000 
observations, consider that with 4,000 observations and 
roughly 70 parameters being estimated, if the r(i)'s are 
random: 
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@E r( i) for the person)?2 


x2( 4000-70 )= 
: variance expected for the person 

i 
WHERE X? is Chisquared with m degrees of freedom; 
variance expected in prediction for a person may be 


taken to be approximately the variance in PBARe 


s 


{ 


| 
| 


| For X® with this number of degress of freedom to be 
significant at the .05 level, the well known approximation 


‘for a X* with over 30 degress of freedom gives: 
i 


1.65 < (2( x? )**1/2) — (2( 4000-70 )—1 )**1/2 


and solving the above, recognizing that (4000-70), the 
degrees of freedom of the X2, one obtains: 


xe 
degrees of freedom 
for the xXx 


| So from the above one concludes that on the average 
r(i)*s must exceed their variance by 10% if they are to 
produce a <xé that can be accepted with 95% certainty as 
significante For this to be true on the average, (1/PRO2)2 
should be greater that 122 or in other words PRO2 should be 
(°9 or tlesse By the formula derived earlier for 4000 
observations in 75 areas M = 4000/75 = 536 A typical PBAR 
for the CORD study activities of which we must chose an 
activity for males is e15- Finally, the earlier results 
oct that R, as defined earlier, is only about 1/4 PBARe 
‘Thus: 


(1-1/4) 
| 53( 15) = ee 
| (1/4 )2( PRO2 )? 
(12 )**1/2 
PRO2 = eee = «21222 
8 
Now even if R/PBAR = 1/3 the chances of results being 


accepted as significant are questionable because: 


(9 * 2/3 )**1/2 
PRO2 ee cates mpm ad = e 86 


However, in either case if the sample size had been three 
times as large M=150 then there is no question that the 
results would have been statistically significante 

For another example, by looking at the weighted 
regression results for activity one, given in Table 1, one 
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sees that for r(€i)*s between ei and e2 the statndard 
deviations are about -02 or 203, which is about 20% of their 
expected valuee Now; for these data there were on the 
average 400 observations in each of 60 origin areas: 


PBAR = e23 and thus PROI approx = e1/e23 approx = 1/2 so tha’ 


i + PROI 
PRO2 = (=$ = SS — ) £* 1/2 approx = 10414 
M( PBAR )( PROJ )# 


The walue of e14 is lower than the e2, or so observed, but 
since results presented in TN 20 show there are structural 
problems with the model and because the parameter estimates 
are not efficient weighted estimates (see Reference 00) the 
difference is not distressinge Also it should be noted that 
there are not exactly 400 observations in each origin area 
and this complicates matterse 

For future analyses of the r(i)"s to determine how they 
relate to the supply that is on the ground simple 
statistical significance of the r(i)"s is not enoughe For 
example the goal might be to study a relation like: 


(attractiveness )[a ] 


Supply meadsure« )=)UaE —-—ee5425.50 2 ae 
(distance )[ b] 


As indicated earlier for such analyses PRO2 should be el and 
probably lesse So, in the context of the last two examples, 
one finds that M should be: 


1 (1-1/4) 
Mo = ---- ----------- = 3000 
015 (1/4)2(.1)2 


So, for 75 areas 600,000 interviews would be needed with 
larger r(i)'ss: 


1 Cl 1/3) 
e15 (1/3 )*( 21)? 


Obviously, research to define the relevant relation 
between what is on the ground and peoples® behaviour is 
going to have to be well planned so that data need be 
collected in only 5 to 10 areas in which there are the 
necessary distinct supply difference to allow the adequacy 
of parameters to be estimated and a relation to be testede 
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REVIEW 


Ge Ewing 


TN 16, 3 and 29 provide examples of three distinct 
approaches to the measurement of the supply of spatially 
distributed facilitiese The first paper describes an ad hoc 
measurement technique for the most part based on unstated 
assumptions about user behaviour, while the Latter two 
papers explicitly oor implicitly assume that evidence from 
recreation behaviour is necessary to estimate a supply 
measuree Recognizing that the attractiveness and distance 
exponent parameters in TN 3 can be empirically estimated, 
the major distinction between it and the last paper (TN 29) 
is that in the latter estimates of a place's supply are made 
without the analytical problems and the expensive, detailed 
behavioural data asseciated with estimating a distance 
exponent and separate attraction values for each activity— 
site combinatione It may be argued that this saving is made 
with a possible sacrifice of validitye How can such supply 
factors be changed to reflect changes made in the supply of 
an area? In a similar vein, TN 16, which requires the least 
specialized (and therefore least costly) data and produces 
the most stable results in terms of test-retest reliability, 
makes a still greater sacrifice of validitye Thus, the three 
papers in the order 16, 29, 3 reflect a progression in the 
trade-off between ease of data collection and supply 
estimation on the one hand and the ultimate validity on the 
othere 

One thread which emerges from these papers is that 
total participation increases as_ total supply increases, 
although greater supply tleads to greater competition for 
customers being faced by any single recreation site (in 
competition with the others which go _ to make up the total 
supply )e Inevitably there is a level of total supply above 
which diminishing returns set in for individual sitese But 
at lower levels of supply it it quite conceivable that the 
competitive effect of additional supply is more than offset 
for the individual site by the increase in overall 
participation generated by the supply increaseec In the 
private sector it is conceivable that the supply level above 
which diminishing returns set in for individual sites might 
be the optimal supply level from the entrepreneurs! point of 
viewe (For other approaches to this problem, see Reference 
00.) However, in the public provision of recreation sites, a 
higher level of supply might be deemed proper Since it would 
produce a lower density of use at individual sites and 
perhaps, therefore, a higher quality of recreation 
experiencee 

TN 16 illustrates a conventional "accounting" method of 
measuring supplye In order to make a variety of decisions, 
the "accountant" requires operational definitions of 
variables (such as the amount of camping facilities ata 
site) and other factors used to compute total supply (such 
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as the average length of stay and the number of usable days 
at a site per year)e The basic question about the validity 
of this approach relates to whether users of these 
facilities perceive supply in terms of the same variables 
and factors as does the planner "accountant". This is a 
valid consideration if it is argued that site use ultimately 
depends on the (perceived) quantity and quality of supplye 

In general, the supply measure is highly dependent on a 
host of often untested behavioural assumptions about such 
things as the acceptable density of sunbathers ona beach, 
the average length of stay at a site fora particular 
activity, and the average friction of distance for an 
activitye Without precise behavioural information to replace 
these assumptions the available measures of recreation 
supply are at best tentative, if not misleadinge Only if 
there happened to be a high correlation between the 
perceptual measures and the “accountantijs" measures would 
the difference in the users; and planners; accounting 
procedures be of minor. importance. In that case, planning a 
proper regional distribution of recreation facilities could 
be properly based on the “accountant's supply measures". The 
main advantage of the method is the relative ease of supply 
measurement and the test-retest reliability of such 
measurese 

The fact that no variable measure of distance between 
users and sites is incorporated in the supply calculation, 
in the manner (say) of a market potential model, might be 
thought of as a serious weakness of TN 16, which simply 
defines sites as equally accessible provided they can be 
reached "by car and/or after a short walk". The degree of 
accessibility of recreation sites, which range from a few 
miles to hundreds of miles from population concentrations in 
some cases, is clearly a critical determinant of the 
effective supply of recreation sitese However, the attendant 
problems of adding a distance component to a supply measure 
are well illustrated in TN 3y which shows that the measure 
of an origin's supply of recreation facilities varies 
condiderably with changes in the size of the distance term's 
exponente This variation would be of Little concern to the 
recreation planner if the supply measures of all places 
changed by the same proportion, as the value of the exponent 
was changede 

It can be indirectly inferred from Table 1 in TN 3 that 
the foregoing is not the casee For example, the larger the 
exponent, the less important are more distant recreation 
facilities in contributing to an origin's overall recreation 
supplye Consider two origins, one with major recreation 
sites nearby, the other with its major sites at a greater 
distancee If a large distance exponent is assumed, the first 
origin has much greater effective supply than the other, but 
if avery small exponent is assumed, the differences between 
the two origins; supplies is slight. Inevitably, planning 
decisions as to whether an origin is relatively 
undersupplied oor not, particularly in situations more 
complex than the above example, depend very much on the 
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distance exponent assumede Though TN 3 does not discuss 
empirical estimation of the exponent, it does’ clearly 
illustrate the effect that variations in the distance 
exponent have on location-dependent supply measures, and the 
_importance of reliable empirical estimates of the distance 
exponent for recreation supply measuremente 
| In the same paper it should be noted that in defining 
the alternative factor as a measure of overall competition 
tor the users from one origin, the implication is that 
(t(€ij), the number of trips from ito site jy is partly a 
‘function of the level of competition from other sitese This 
in turn implies a choice model of trip making, in contrast 
to the widely-used unconstrained gravity model which assumes 
tCij) to be a function of only the characteristics of i and 
_Jj and their distance apart, but not related to the nature of 
alternatives. Suprisingly, no direct empirical comparisons 
have been made of the predictive power of these two types of 
model of interaction behavioure Since the gravity model can 
“be seen as a special and simplified case of the choice model 
/incorporating an alternative factor, an obvious question is 
‘whether the more complicated, general model enhances 
prediction of trip distributions sufficiently to merit the 
extra calibrating effort that its use involvese 
One form of indirect answer to this question is’ to 
' acknowledge that high R2 values have often been achieved in 
calibrating gravity models against trip distribution datae 
However, in their concluding statement, Beaman and Smith 
caution us that high R® values are no guarantee of the 
| appropriateness of a modele An illustration of this point, 
in the form of a high R? value obtained using a misspecified 
| gravity model calibrated against data that were in fact 
| consistent with the more general choice model, is provided 
in Reference OQ« Recognising the problems associated with 
empirically estimating distance exponents as well as site 
attractiveness scores, TN 29 seeks to avoid these parameter 
estimation problems entirely by adopting a different, though 
still behavioural, approach to the measurement of supplye It 
attempts to measure the local supply of facilities for a 
recreation activity by using as a surrogate of supply the 
participation rate in that activity suitably discounted in 
terms of local socio-economic population characteristicse 
However, certain interpretation problems do arisee The 
reasons why over-or under-participation in any region is 
inferred to indicate an over-or under-supply of facilities 
in that region, rather than to reflect some other possible 
causes, is discussed in the paper itselfe 

Two associated issues deserve commente Firstly, the 
inclusion of city-size levels as predictors of participation 
along with socio-economic variables implies that some of the 
regional variation in participation is related to the size 
of the city participants live ine However, unlike the 
effects on participation of education or income, over which 
the recreation planner has no control, the effect of city 
size level may reflect Locational constraints on 
participation similar in effect to inaccessible supplye 
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Specifically, the Larger the city» the less readily 
accessible are surrounding recreation facilities on account 
of the time involved in travel within larger citiese Thus, a 
lower participation by large-city dwellers may reflect a 
less accessible supply of recreation facilities, rather than 
some innately lower propensity to participates Z¥%t could be 
argued, therefore, that in Equation 3 in TN 28, it is 
improper to include a city~size effect as that wiil decrease 
the value of the expected level of participation of | 
individual i, in region g,» in activity ae Without that term 
in Equation 3, the resultant ltarger vatues of in some areas 
would decrease the supply factor estimate, and if the above 
assumption is accepted, more accurately reflect the extent — 
of (accessible) supply in areas with large cities. Clearly — 
the recreation planner cannot influence personal attributes 
such as education or incomee Yet he can modify the effect > 
large cities have on their inhabitants; for whom rural 
recreation facilities ™ are made more distant by the 
intervening speed-restricting urban spacee If supply factors 
were Low for these areas, he could tocate recreation 
facilities close to the edges of, or within the boundaries — 
of, the larger urban arease Certainly the question of 
whether to exclude that factor from Equation 39 deserves 
further consideratione 

A second issue relates to the assumption implicit in 
the paper that there is a relationship between participation | 
Levels and supply levelse 

From a positive view it is possible tc consider supply 
as measured in TN 3 by Beaman and Smith and as measured in 
this paper as complementarye In other words supply factors 
can be viewed as intrinsically a perceptual measure of | 
supply as reflected by people’s participation, whilst the 
gravity model measure: 


@E [AC j)/FCDC gsJd))] 


can be considered as a more direct measure of supply "on the | 


ground", This being 5S0, a useful exercise would be to 
consider supply factors as the dependent variable and 
alternative factors as the independent variable inva 
possibly non-linear regression modele Clearly, if the 


latter were a good predictor of the former, knowledge of the 
regressed relationship would simplify the estimation of 
future participation increases in any area resulting from 
increases in the supply of particular recreation facilitiess 
And, of course, the shape of the function. relating the two 
variables would provide waluable evidence about the varying 
marginal utility of extra supply for any given existing 
level of supply as measured by the independent variable. 
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CHAPTER VI 


THEORY ARTICLES 


INTRODUCTION 


Even though it is possible to give dates at which the 
different theory articles in this chapter were prepared, and 
to relate the theory articles to CORD Study work that was 
going on, this would not give a true picture of why these 
theory articles were writtene For example, it was quite by 
accident that Roy Wolfe provided a copy of his “inertia 
model for travel" to the author of the "Distance of the 
Function of Distance". papere Wolfe's paper (see Reference 
32), which eventually appeared in the Journal of Leisure 
Research, raised questions that had not been brought into as 
clear a focus by the CORD Study research underway in 1972. 
In the CORD Study, the distance functions that were being 
used were either the traditional distance to some exponent 
type of function or the kind used by Cheung in TN 1 (which 
was selected because it gave a better fit to available data 
than traditional distance functions). 

Neither the choice of a function because it @ives a 
good R* in a particular regression, or the choice of a 
function because other people have used it, is particularly 
satisfying to some researcherse In his paper, Wolfe was 
making the clear suggestion that behaviour should play a 
more important role in the choice of the functions used in 
geravity modelse He presented certain plausible ideas, but it 
is these very ideas and disagreement with the way they were 
followed through logically that prompted the preparation of 
TN 14. 

TN 32 and 37 are closely related to each other and 
should be considered in conjunction with TN 10, which 
presents the applied work that shows the practical relevance 
of pursuing the clustering ideas discussed in the other two 
paperse It was informal discussion of the ideas presented in 
TN 32 (carried out long before the note was prepared) that 
prompted the quantitative work reported on in TN 106 When 
quantitative results became available on how the Canadian 
population would be classified into clusters based on the 
activities they participate in, it was very clear that the 
original ideas that had prompted the empirical work should 
be documentede So, this was done in order that the broad 
implication of the empirical work would become clear from a 
theoretical perspectivee 

A meeting of CORD Study researchers and a meeting of 
the Canadian Parks and Recreation Association provided the 
final stimulus necessary to prepare TN 32 when it was 
recognized that there was a need to clarify to recreators 
and professionals in the field of recreation the fact that 
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people participate in one activity independently of anothere 

Actually, the first version of TN 32 contained much of 
what was ultimately included in TN 37-¢ The author of the 
Latter had long been involved in considering how one should 
model people's behaviour when participation in a number of 
activities at a number of facilities was involwede Soy when 
the clustering approach was being described in early drafts 
of TIN 32 a number of the issues were woven into his own 
papere However, the appearance of the Hendee and Burdge 
article on substitutibility and its practical applications 
in the Journal of Leisure Research prompted a sorting out of 
idease A number of considerations that would stand on their 
own were placed in a review of the Hendee and Burdge 
articlee This teft a fairly pure discussion of clustering 
and its practical implications (from a theoretical 
perspective) in TN 32e 
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CORD STUDY TECHNICAL NOTE 14 


DISTANCE AND THE *REACTION® TO DISTANCE 
AS A FUNCTION OF DISTANCE 


Je Beaman 


ABSTRACT 


"Distance and the "reaction' to distance as a function 
of distance" is a phrase designed to emphasize the need for 
more analysis of the behavioural significance of gravity 
functionse This paper, in pursuing the behavioural 
significance theme, concentrates on research related to the 
“inertia models" suggested by Wolfe. The most important 
concept presented is that if an inertia model of travel 
behaviour of the type Wolfe describes is accepted, then the 
decision to visit a given location must involve a reaction 
to distance in marginal rather than absolute termse 

To quantify, the paper focuses on the effect of going 
one more unit of distance beyond a given point. This 
quantification being carried out leads to the conclusion 
that analysis of reaction to distance in marginal terms 
should be based on the properties of an “impedance due to 
distance function", IDF(d)e A discussion of five gravity 
functions provides a basis for illustrating points and 
methods introducede IDF(d)'s are derived for the functions 
considerede Conjectures on the response to distance 
suggested by the functions are checked by reference to the 
impedance of distance functions derived for the gravity 
functions introducede 

Conclusions relate to the fact that it is easy to 
misinterpret the significance of gravity functions as to how 
they imply decisions are made if one only looks at the shape 
of such functionse It is shown that in cases where one may 
think each new mile to be travelled offers more resistance 
than the last, each new mile offers tless resistance or 
constant resistancee 


PURPOSE 


This paper presents a theoretical discussion and 
findings on "distance travelled as a function of distance". 
The purpose is not to fit functions to data but to clarify 
the behavioural interpretations that should be applied to 
functions which appear to fit available datae 


T™N 14 CHAPTER VI page 387 


INTRODUCTION AND THEORY 


- Wolfe (see Reference 32) has discussed what is usually 
considered to be the “distance part of a gravity function". 
General discussion on gravity and potentiat models is 
adequately covered in such sources as Olsson and Wilson 
(Reference 20, 31)e Ginsberg has formulated a Markow Renewal 
Model that embodies the inertia model concept, but only 
mathematically oriented readers would find Ginsberg's 
formulation (Reference 12) easy to followe A paper in the 
recreation area employing a Markow Renewal Model is Gormany 
Peterson and Lime (Reference 13)- Of course, Woife is not 
the only one (or even the first) to conceptualize the effect 
of distance on tength of trips, although his explicit 
discussion of distance travelled as a function of distance 
is a seminal contributione In fact, inertia effects are 
implicit in a veriety of formulationse Stouffer's 
formulation of “intervening" opportunities (Reference 26) 
introduces “inertia" effects resulting from the alternative 
opportunities encountered. Research on geographic and social 
mobility has resulted in discussion of the "principal of 
cumulative inertia" as explaining decreasing mobility rates 
with increasing length of stay in a community, profession or 
some other state (see Reference i7, 18)e 

Discussions of distance travelled as a function of 
distance consider a function that is related to the 
probability of moving from being a traveller to being a 
visitor at a distance d from an origine If one accepts the 
notion that such a function exists, the claim can be made 
that an impedance of distance function is of fundamental 
structural importance in understanding travel behavioure For 
example, picture inertia in Wolfe's terms by viewing the 
decision making process as involving the thought: "If I have 
gone as far as dy, what is my desire to go further?" And 
given the question, consider the expression: 


(1) P(d,d) = Probability of not going #6 units further than d 
# going at least to d but not to dt¢ 
# going to d or further than da 


Since the area under the gravity function between d and 
d+96 "measures" the number of persons who stop; while the 
area from d to infinity measures the number that go beyond 
ds 


(2) Pld,d) approx = Dégld)/tl( g(x dx) 


WHERE the integration is from x=d to infinity and @ is some 
relatively short distance (e€ege mile or less for 
visits to major non-urban parks) and g(d) is the 
distance part of a gravity function that is defined so 
that the integral in the denominator exists and 
defines the number of persons (parties, vehicle or 
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whatever units) travelling beyond a distance de 


By allowing @ to approach zero, and if the Limit 


-~ 


indicated in Equation 3 exists, the Limit as @ approaches 
zero of the probability of not going at least @ further 


divided by#é is a function that is an instantaneous measure 


of the impedance of distancee For this reason the “impedence 
of distance function" is defined by: 


Limit (as ¢@ —> 0) P/¢@ 
a(d )/%t( gl x dx ) 


(3) IDF(d) 


WHERE the integration is from x=d to infinitye 


From the way IDF(d) is defined, it is clearly 
reasonable to argue that IDF(d) is related to the hesitance 
felt by a traveller toward going an additional time—distance 
unit, ieee it is related to making a decision in marginal 
termse It is not unreasonable to propose that IDF(d) is 
directly related to the marginal utility of continuing on 
from a given pointe So, a decision in absolute terms is a 
decision in terms of the d's to certain destinationse For 
example, assume a person sees two physically identical sites 
at di and d2 as available. In absolute terms one visualizes 
an “economically rational" person as choosing the closer 
site, or possibly suspects the two sites get visitors from 
the given origin in the ratio g(d1)/g(ld2)e 

However, @iven the concepts of "marginal" and 
"absolute" decision making, it is reasonable to argue that 
travel decisions involve both a marginal and an absolute 
componente Single purpose visits, particularly involving 
routine household functions (eege getting groceries) may 
involve a great deal of absolute "economic rationality" and 
little of the element, "if I go as far as X, I might as well 
go to Y". On the other hand sightseeing or vacation travel 
may be heavily weighted toward decisions based on marginal 

utility considerationse Wolfe's inertia concept applies 

primarily to this latter kind of travele It is fair to say 
that this paper investigates a "polar" type of travel 
decisions, namely decisions made in marginal termse 


THE IMPEDANCE OF DISTANCE FUNCTIONS 
FOR SEVERAL g(d) FUNCTIONS 


Although it is readily acknowledged that Little is 
known on how decisions are made by various people for 
various types of trips, the model just proposed is used to 
see how a homogeneous group of people might react to 
distance if(their travel decisions were made in marginal 
terms and if their g(d) were known( And, since there exists 
a variety of g(d) functions that could be the correct g(d) 
function for a group, there are g(d) functions available 
that may reasonably be considerede 

Five functions that have been considered in modelling 
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Figure 1 The Distance Component of “Gravity Functions” * 
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* The parameters of the functions were intentionally selected so that the functions would 
intersect at 70 miles and 130 miles except for (1) which for as2 can be forced only through 
one point. 
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Figure 2 


impedance 


001 


TN 14 


40 


Impedance of Distance Functions for the 
Functions Given in Figure 1 


| 


curve 8: 
/ 


curve 2 


ay fe CULVEAE 


em « 
°° a — ° omme © ame a 


°° mmo, 
curve 1 eens caste 


%e 
fe 
ee 
ce 
%e 
. 
ee 
te 
° 
te 
sisleieren 
ee 
Soe 
fee 
ee 
Oee 
Coe 
° 


Seeee 
Coes 
Peete 
Pesos 
Ceces 
. 
OO PN 
feos 
Peers 
Seees 
@ees 
fees 


ee 
Te eC e Sele, 


60 80 100 120 140 160179180 70° 220 ©624077°260 
d= MIND d*dmax d=MAXD 
distance in miles 


CHAPTER VI page 391 


travel behaviour are defined by Equations 4 through 8 and 
these functions are shown in Figure tle The "standard" 
gravity model is based on: 


(4) gld) = Cad**—a@ 


Unpublished research carried on as part of the Canadian 
Outdoor Recreation Demand (CORD) Study suggested: 


(5) gd) = Cd**(—ad ) 


which is subsumed under the UeSe Corps of Engineers? 
equations for reservoirs which is defined by the form: 


(6) geld) = C exp(—-ad**b) 


Beaman and Leicester (see Reference 1) presented a variety 
of examples using Linear functions to give potentials to 
visit. The following expression, while not the general CaSe@y, 
is representative of the Linear class of functionse 


| b-ad3; b>03a>0 and O < a < b/a=d( max) 
| 
a(d) = | 
| 
| zero otherwise 


Finally, the Wolfe "inertia" function (see Reference 32) is 
considerede It may be written as follows: 


(8) g@(d) = Cd**(—at(1/n) lLog(d/m) ) 


WHERE a is a "usual" gravitational distance exponent 
and n and m are other constantse 


CONJECTURES 


The five forms of g(d) introduced above embody a wide 
variety of implications about the effect of distance on the 
decision to travel beyond a point which has been reachede So 
before explicitly examining the IDF(d) functions for the 
equations introduced, some conjectures on the nature of the 
“impedance functions" expressing the “marginal utility" of 
additional travel are valuable because they illustrate how 
confused a researcher's impressions of what a person's 
reaction to distance may be when based on knowledge of g( dc) 
rather than IDF(d)e 


Conjecture le The linear form given by Equation 7 
arises because of a constant force 
(impedance) acting in opposition to 


further travele 


Conjecture 2e An exponential form of g(d) implies 
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an impedance related to a force to 
stop that increases with distancee 
q (See Equation 5e) 


Conjecture 3e The C/(d**a) function implies that 
the further one has gone, the less 
likely one is to go another mile. 


INTERPRETATION OF THE IDF(d)'s 


| The relation presented in Equation 3 may be used to 
determine the actual form of IDF(d) for the five a(d)'s 
introduced abovee It is recognized that the way that IDF(d) 
is defined means it is similar to the force of mortality 
function considered in demography but this fact is not 
considered or exploited in the papere So, using Equation 3 
the IDF(d)*s were derived and are shown in Figure 2. The 
“traditional" gravity function has an IDF(d)3: 


(9) IDF(a4) = (Ca**—a )/(Cae*( 1-a)) 
| (a-1)/d 


where it must be recognized that a must be greater than onee 
In fact, ad hoc regressions that suggest that a is less than 
or equal to one implicitly suggests) that an (infinite) 
number of visitors go further than any distance one may 


! 


ete - a condition researchers’ should consider if 
regression gives ana less than or equal to onee In the 


second case: 


(10) IDF(d) = (C#*-ad)/(Ce**—-ad/a) = a 


which is Equation 11 with b equal to le The IDF(d) of the 
generalization of Equation 10 is different than the 
preceding since the integral in its denominator cannot be 
written in terms of a "simple" familiar functione 


(11) IDF(d) = C exp(—ad)**b / (% C exp(—ad)**b dx) 


WHERE the integration is from x=d to infinity; and if b = 2, 
the integral can be evaluated using tables for the normal 
distribution and where, if b is greater than zero, tables 
for the incomplete Gamma function allow evaluation of the 
functione The IDF(d)"s were calculated using tables prepared 
for use in statistical work with the Pearson Type 3 
distribution (see Reference 21)e Specifically the integral: 


% (exp[ ax ]**b dx) 


may be transformed to a usual Gamma function form using Y = 
ax**be In retrospect, it seems clear that IDF(d)*s could 
have been determined most easily if available Gamma Function 
values (integrals 0 to infinity) had been used with 
numerical routines to decrease them appropriately to get the 
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values of the integrals of g(d) for increasingly targer 
values of de 
Fhe IDF(d) for the "Linear" g(x), takes a simple form: 


(12) IDF(d) = (b-ad)/( «Sal d—d( max ) )2 ) 


where the denominator is readily recognized as the area of 
the triangle defined by a vertical line at some distance da 
on the x-axis, the x-axis and the Line defined by Equation 
Te 

Strangely enough Equation 8 is of such a form that for 
m greater than one and n greater than zero (as Woltlte 
suggests they should be, so he gets his inertia effect) the 
area under the curve from d to infinity is "infinite". So, 
to have a usable function, & maximum distance to be 
travelled must be stated (esge 250 miles as in Figure 1) and 
then it is reasonable to write the following: 


C13) EDF(d) = Cad**(—-at(i1/n) Log(d/m))/t( g(x )ax ) 


WHERE the integration is from x=d to 250 milese 
This must be evaluated by numerical metheds for 
different walues of ay ny and me 


The reader should note here that a variety of problems 
related to the existence of certain integrals have been 
ignored because it is not the object of this paper to 
present a comprehensive treatment of mathematical problemse 
However, it?’ is possible to use Lebesque-Stielt jes? 
integration to associate a measure with the point d equals 
zero so it becomes plausible to write of (as Wolfe does) 
start-up inertiae In such a formulation an IDF(d) can still 
be considered to apply when d is greater than zeroe 

One particular point should be noted regarding Wolfe's 
function because Equation 8 may be written as below: 


(14) gld) = Cd**-A d**( b( Log(d))) 
where b = 1/n and where 
(15) A= a +t (L/n) log(m) 


Wolfe should compare his inertia function with a gravity 
function with d**(-A) rather than d**(-a) as he does in his 
papere When viewed this way, one sees that the inertia 
component of his function is expressed by the b(log(d)) part 
of the exponent of the distance part of his gravity 
functione A number of other points may also be noted: 


le Wolfe's function is not the only function 
considered for which impedance decreases as 
distance increasese 

2e The b(lLog(d)) component of the Wolfe function 
does initially serve to introduce a larger 
decrease in impedance than for a corresponding 
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a**(-a) functione 
3e At some arbitrary point, impedance begins to 
increase and then subsequently becomes infinite 
in a discontinuous and arbitrary way (ecge as 
opposed to the linear form in which impedance 
becomes infinite but in a continuous and 
"understandable" way )e 
4e The Wolfe inertia function might as well be 
written as Cd**(—-Atb(Log(d))) since the m is 
only a parameter that should be absorbed in 
the gravity function, because not both m and a 
can be estimated in a regression (since when A 
and b are determined, a and m are not uniquely 
determined by Equation 15 )e 


Given the above points, it is fair to suggest that 
Wolfe's function involves an arbitrary and ad hoc correction 
to the usual distance part of a gravity function (d**(—a)); 
as Wolfe suggests, it is a first correctione 
| As may have been suspected, conjectures on the kind of 
impedance function that would be associated with a given 
gravity function were constructed so that all conjectures 
present an incorrect viewe The Linear “gravity function" 
involves a situation where the force to stop increases with 
distance. In fact, as one approaches the point dmax equals 
b/a beyond which people do not go, the force function 
‘approaches infinity (see Curve 8, Figures 1 and 2)e The 
exponential function considered does not suggest that each 
new mile one faces is viewed as offering a greater (or 
lesser) impedance than the last milee Because of g(d) being 
the exponential “sravity function", IDF(d) equals a 
constant; it implies that every new mile is "like" the last 
one (see Curve 2 of Figures 1 and 2)e Regarding the third 
conjecture, the distance part of the traditional gravity 
function C/d**(a) does not imply that each new mile presents 
a greater impedance than the laste The IDF(d) function for 
1/a**( a), as shown by Curve 1 in Figure 2, is a function 
decreasing from very large values near zero to approach zero 
as d becomes large: each new mile looks easier than the last 
and at large distances, each new mile presents almost no 
resistance to further travel compared to the early miles 
travellede 

As noted earlier, the equation used by the Corps of 
Engineers is a generalization of the simple exponential form 
in the way shown below: 


C**x(-ad) = C exp(—ad**b ) if b= 1 


The Corps! function is particularly interesting because its 
IDF(d) function changes to have three types of shapes: 


1e For b less than one it has the form shown by 
Curves 3 and 4 of Figure lie 

2e For b equals one it has the form given by Curve 
2 of Figure le 
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3e For b greater than 1 it has the form given by 5 
and 6 of Figure le 

Thus, when the IDF(d) function for the Corps of Engineers! 
function (see Equations 11 and 14) is considered, three 
forms of the curve must be recognized. Firstly, if b is less 
than one, it is seen from Figure 2 that IDF(d)*s similar in 
shape to the one for d**(—-a) are obtainede If the b in the 
Corps? equation is one, the equation becomes the simple 
exponential C**-ad for which the impedance function is a 
constant, as already notede Finally, if bis greater than 
One, the IDF(d) function is similar in shape to the 
impedance function of the Linear form of the distance part 
of a gravity functione As mentioned above, the impedance 
function for the linear form goes to infinity as distance 
approaches a maximum distance, dmax, that people travel 
according to the “Linear gravity function". In this regard, 
the Corps’ IDF(d) function, for b greater than one, does not 
@o to infinity, for a finite dy but Figure 2 does show that 
for b equals 4, IDF(d) “takes off for infinity" very 
rapidlye 

The IDF(d) for Wolfe's “inertia" gravity function is 
unique among the functions considered there because of its 
shapee When MAXD is defined as 250, as it is for the Wolfe 
function shown in Figure 1, the IDF(d) function, Curve 7 
shown in Figure 2, drops initially showing a decreasing 
perceived impedance of distance as distances increases: in 
Wolfe's terms, an inertia to continue to travel is built upe 
But, as suggested by the IDF(d) function, ultimately the 
impedance begins to increase and then rapidly goes to 
infinity (ecge in the example at MAXD = 250 miles)e 

In Wolfe's "inertia" gravity function, (if something is 
not plausible,) it is the form of the g(d) functione If the 
parameters of Wolfe's funciton are such that a minimum of 
e(d) occurs (in Figure 1 the minimum is at MIND = 170 
miles), there is a point, the minimum of @(d), beyond which 
increasing numbers of people stop at each mile up to an 
abrupt boundary beyond which there are no tripse But, unless 
there is really some physical boundary to be considered ina 
particular problem, it is reasonable to suggest that no such 
abrupt discontinuity in g(d) should occure In this regard, 
Wolfe's function might better be defined by : 


ae(d) = Cd**(-atbex*x {(-—.5 )( ( d—u )/s )2} ) 


SUMMARY 


From the curves presented in Figure 2, one notes that 
there are IDF(d)"s that (1) increase, (2) decrease, (3) are 
constant, and (4) both increase and decreasee The variety of 
shapes of IDF(d)'*s shown in Figure 2 was certainly not 
expected by the authore Possibly, the most unexpected 
results found, however, relate to the fact that not all 
IDF(d)*s become infinite, showing that at least beyond (some 
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distance) further travel becomes less desirable in the sense 
that the next mile offers more resistance than the Last - 
each has higher impedance than the laste In fact, as 
already cited twice, as well as having an IDF(d) that is 
constant, functions commonly used in modelling were seen to 
have impedance functions that approached zero as distance 
travelled increasede One may note that decreasing impedance 
suggests that the fact that g(d) and the integral of g(d) 
from d to infinity both approach zero does not imply the 
increasing disutility of travel. Rather, the fact that some 
IDF(d)"s approach zero as d approaches infinity may be taken 
to reflect the gradual filtering off of people from a finite 
‘universe, even though the impedence to further travel 
continues to decreases 


CONCLUSION 


| In eliciting an intuitive feeling for his inertia 
model, Wolfe uses such phrases as: ‘In the present note a 
suggestion is offered for one of these deficiencies -— the 
unresponsiveness of the gravity model to the effect that 
distance itself thas upon the perception of distance.' 
feeeWhen trips are very short, the friction of distance is 
negligible. The number of short trips, however, may be 
smaller than expected because a great many people may not 
wish to make a trip of any length, however short; their 
starting-up inertia, as it were, is too great to overcome.‘ 
"at the opposite extreme, one might hypothesize that, among 
the minority of people who indulge in lengthy trips, a still 
smaller minority find travel itself so stimulating that the 
further they go, the further they want to @oe* 
These statements imply that certain travel decisions 
are made in marginal terms; in terms of one’s reaction to 
- each additional timedistance unit to be travelled (in a 
|_manner implied by the IDF(d) function )e 

The point in recognizing marginal or some other 
| decision making is recognizing that the most prominent lack 
in discussions employing gravity models is an inadequate 
behavioural basis for the gravity function introducede 
However, this is not to say literature in which behavioural 
considerations play a role is nonexistente But, for example, 
one should not consider Wilson's "Interpretation of Terms 
Section" of his book (see Reference 31) as the point at 
which behaviour is introduced into the ent ropy-transport 
modele Human behaviour in so far as it is relevant was 
introduced as implicit in the entropy frameworke It is 
interesting to note Simon's discussion of learning theory 
models (see Reference 25) where he shows that a 
theoretically derived equation, which is the case for 
Wilson's gravity model equation, may be consistent with a 
number of drastically different behavioural theoriese One 
may also see Cesario (see Reference 7) where he discusses 
why one of Wilson's more unrealistic assumptions may be 
pelaxed to be more realistic without affecting the model one 
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would use to explain travel behavioure 

From a theoretical perspective the significance of this 
note is clearly considered to lie in its focus on the 
behavioural significance of the IDF(d) functione The author 
frankly was surprised by the variety of IDF(d) functions 
resulting from the consideration of g(d)*s which are so 
Similar over the range of d*s often considerede (See Note at 
end of texte) However, when the significance of IDF(d) is 
recognized, research may be designed to allow more adequate 
definition of IDF(d) and thus, gld). 

From a broader perspective it is hoped that this 
discussion of marginal decision making will prompt research 
on the “mix" of marginal and absolute elements in varicus 
travel decisionse Discussion and research are needed 
because, although formulas resulting in good aggregate 
predictions of trip distribution are valuable for certain 
applications; manipulation of these formulas without asking 
why they work is not secial science researche Social science 
research must ultimately relate accepted formulas that 
predict behaviour to fundamental social, psychological and, 
possibly, biological processese 

Note: The similarity of different g(d)'s implies that 
the choice of a given g(d) on the basis of R® or another 
criterion would involwe a high probability of an incorrect 
choice (of Type 2 error)e This should be recognized in 
evaluating the significance of results by Wilkinson (see 
Reference 30)e Actually, any of the gravity functions 
considered here deviate from d**(-a) in such a way that in 
Wilkinson's terms an inertia effect may be found. Also, one 
should recognize that Wilkinson is dealing with Wolfe's 
function, which suffers from the other problem already 
notede 

The author believes that Wilkinson was estimating 
parameters that could not be uniquely estimated without 
making unwarranted assumptionse In fact, Wilkinson's 
introduction of m= 1/8 into his model was not based on 
objective observation of visitor behaviours Only when om is 
determined experimentally should it be regarded as anything 
but an arbitrary and ad hoc correction to ae On this matter, 
one may also see that points made about Equation 15 imply 
that there are an infinite number of parameter sets, one for 
each m (eegey m equals 1/8, 1/4, 1/2, 1-0, 2e0, etce) that 
give the same explained sum of squares in the kind of 
regression that Wilkinson carried out - thus his 
parameterized model is not really based on scientific 
observatione 
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CORD STUDY TECHNICAL NOTE 32 


PRACTICAL APPLICATIONS OF CLUSTER ANALYSIS 


Je Beaman and Se Lindsay 


ABSTRACT 


This paper presents different ways cluster analysis 
results may be used by planners and decision makerse It 
gives hypothetical results for cluster analysis of park user 
surveys and for national household surveyse These examples 
are discussed in terms of: 1) how clustering results can be 
used in considering the equity with which opportunities to 
participate in activities are provided; 2) how cluster 
analysis results should be important in the prediction of 
behaviour; and 3) how cluster analysis results are relevant 
to futures research in ways not treated by other 
researcherse 

The conclusion is reached that though practical 
applications of cluster anlysis have not yet taken place, 
the technique holds great promise for the futuree 


PURPOSE 


The purpose of these notes is to present to planners 
and decision makers some practical implications of results 
that can be obtained using cluster analysise« 


INTRODUCTION 


This paper concentrates on presenting different ways 
cluster analysis results may be used by planners and 
decision makerse Burdge and Field (see Reference 4) have 
cited the need for a cluster analysis approach to the 
processing of participation datae Tatham and Darnoff's work 
on cluster analysis (see Reference 27) deals with some of 
the issues presented heree Their work is of particular 
relevance for planning implicationse Notions on equity, 
however, are not developed in quite the same way as in the 
present papere Their discussion does not deal with 
prediction of behaviour and futures research in a way that 
is similar to the perspective endorsed heree A paper on the 
methodology of cluster analysis by Romsa et ale has already 
been received by Parks Canada and some results of Romsa's 
methodological investigation have been publishede 
Specifically, it is in the Journal of Leisure Research (see 
Reference 22) that Romsa and his’ colleagues at the 
University of Windsor have reported on the use of cluster 
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analysis for deriving activity packages from the Canadian 
Outdoor Recreation Demand (CORD) Study National Survey 
Informatione Given Romsa's methodological effort and given 
the Burdge and Field and Tatham and Darnoft general 
discussions, avery practical discussion is now required 
concerning the ways in which Romsa's results or the results 
that could be arrived at by others, can be used by planners 
and decision makerse 

Since there is a difference between the kind of cluster 
analysis carried out by Romsa and his colleagues, and the 
kind of work that has sometimes been called cluster analysis 
by other researchers, the reader should look carefully at 
the example presented in the next section in order to 
understand the kind of cluster analysis referred toe 
Briefly, the cluster analysis approach discussed here does 
not look at intercorrelations between activities, but rather 
clustering refers to a relation defined by individuals 
participating in Similar collections of activities: 
clustering is of individuals on the basis of activities in 
which they participate, rather than on any intercorrelation 


between activities over some collection of people (see 
‘Figure 1)e 


Concern is not with people grouping themselves in the 
sociological sense of forming social groups in which there 
is interaction; at least this is not necessarily the casee 
Here, grouping refers to people being in collectivities; 
people are indicated to be members of particular 
collectivities defined by participation in a certain set of 
activitiese The set of activities that is related to a 
collectivity is said to be or define the activity package 
for the collectivitye In the following discussion the 
existence of collectivities defined on the basis of 


particular people participating ina particular collection 
of activities defined by their “activity packages" is 
 acceptede 


THE USE OF CLUSTER ANALYSIS 
WITH PARK USERS SURVEY INFORMATION 
TO OBTAIN RESULTS THAT MAY BE OF USE TO PLANNERS 


One of the problems that arises in trying to understand 
the results prepared from park user surveys data on 
activities participated in is (1) the results presented 
rarely reflect the total Loading of facilities and (2) the 
results, eege tables prepared from raw data, do not givea 
clear picture of the multi-faceted participation 
characteristics of the users of facilities. (Assume here 
that the user information is collected as part of an 
entrance or exit surveye ) In reality one of the major 
dimensions of analysis concerns may be the mix of activities 
in which people participatee Information on the collection 
of activities in which an individual participates and on the 
characteristics of participants simply loses the dimension 
of activity mix when presented on an activity by activity 
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basise Even cross-tabulations of activity against activity 
only give a very limited insight into individuals!® activity 
mixese 

Participation analysis that goes beyond a simple 
analysis of facility loading requires a technique that 
reflects the interrelationships between the activities in 
which individuals participate. The use of factor analysis 
to examine intercorrelations defined by people's 
participation in a number of activities violates the basic 
assumption of defining “activity Packages" (see the 
Appendix )e Rather, one must adopt a true cluster analysis 
technique that will break the user population into 
relatively homogeneous groups onthe basis of those 
activities in which individuals participate. 

For some people the preceding discussion may be rather 
abstract and even appear obtusee So, assume that three types 
of activities are provided ina park: (1) activities which 
are not compatible with the theme or general purpose of the 
park; (2) other activities that are not offensive to the 
purpose of the park but not exactly in Line with its theme; 
and (3) activities that are consistent with the purpose of 
the park. If a cluster analysis of park user survey data on 
the basis of the activities in which visitors participate 
were carried out for a given park, then one could see how 
people's participation is divided among the sets of 
activities just notede 

Figure 1 presents hypothetical results of a cluster 
analysis of summer activities in a given Park, suggesting 
that there are six clusters and thus six types of park 
visitorse The figure shows that ten percent of visitors show 
an interest both in incompatible activities and in 
activities that do not tie strongly to the theme of the 
parke The third group indicated is not involved, to any 
large extent, either in incompatible or in highly compatible 
activitiese The fourth cluster of people noted are those 
that present an extremely problematic pattern of clustering 
by participating in both incompatible and highly compatible 
activitiese The planner might wish to serve these visitors 
by moving incompatible activities out of the park (to be 
supplied by the private sector) while Leaving the highly 
compatible activities in the Parke 

Finally, the last two clusters indicated in the figure 
involve respectively 20 and 10 percent of the park visitors. 
These are the clusters made up of people whose participation 
focuses on highly compatible activities. However, Cluster 5 
involwes a high lewel of participation both in activities 
that are not highly related to the park theme (but not 
incompatible), and in incompatible activities. 

One should note that while Figure i may clarify how the 
results of a cluster analysis of park visitor survey data 
might be of use to a planner, the figure oversimplifies the 
Kind of clustering pattern that might be expected in a 
cluster analysis. This oversimplified clustering pattern 
results in an "obvious" interpretation of the results, which 
would rarely occur. Certain sets of activities containing 
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some highly compatible activities and some activities of the 
other two types may turn out to define activity packages for 
peoplee The activity package just alluded to may have few 
activities in common with another activity package that also 
involwes activities that run from incompatible to highly 
compatible. The existence of two clusters that both involve 
‘incompatible and highly compatible activities, may reflect 
‘an orientation of some people toward physically demanding 
activities and others toward passive activities when both 
‘incompatible activities and compatible activities are 
considered. 

In terms of the example just cited, one can see that a 
dimension of clustering not suggested by the figure may be 
relevant for consideration in planninge However, one may 
wish to ignore the active-passive split and, for planning 
analysis, combine clusters to match the pattern shown in 
Figure 1 by combining clusters that are identified by a 
computer analysis of park user survey datae 


CLUSTER ANALYSIS OF HOUSEHOLD SURVEY INFORMATION —- 
AREAS OF CONCERN TO PLANNERS 


The planning considerations to which the development of 
activity packages from data collected from people at their 
aed is claimed to be important, are (a) the equity factor; 
(b) planning for specific areas or population; (c) future 
research; and (d) predictions of behavioure This data can 
relate to participation in activities at a particular park 
or to participation in all recreation activitiese This paper 
is working in the latter context - data collected on 
participation patterns at any park, private facility or at 
homee 

| The reader is asked to refer to Figure 2, which is very 
similar to Figure 1, except that the focus of attention is 
not on activities that take place in a given park but 
rather, on a set of activities which for some reason was 
chosen to be considered in a National or Regional Surveye 
Thus, in contrast to the way activities are divided in 
Figure 1, the list of activities dealt with may reasonably 
be broken into active or passive; into social, cultural or 
athletic; or into whatever classification is appropriate for 
the policy related research and/or planning of the 
Administration Unit involved in a given analysise 


Equity 


Cluster analysis has value in considering equity 
because by breaking up the population according to the 
activity packages in which they participate, one can see the 

equity with which activities are provided to various 
collectivities of peoplee It is not suggested that all the 
activities listed in a survey should be provided in such a 
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*** In the text reference is made to tent camping, trailer 
camping, fishing, hunting and driving for pleasure. 
Participation in these activities is indicated by the 
dark grey areas. Tent camping, fishing and hunting are 
assumed to be ATHLETIC-ACTIVE, so to distinguish par- 
ticipation in these activities the dark grey area 
appears under ATHLETIC-ACTIVE for Clusters 1 and 2. 


TN 32 CHAPTER VI page 404 


way that each collectivity associated with an activity 
package receives the same dollar input from a given agency's 
Parks and Recreation Expenditures (receiving the same dollar 
‘input is not even necessarily possible); there is no 
suggestion that equity is reflected if planning involves 
allocating money to activities so that some kind of balance 
is kept in the money benefits that people with various 
activity packages "receive". However, if one is placing 
money into a number of activities, then one can look at 
activity packages for the collectivities into which a 
population is divided and see in which packages the 
activities just noted occur and, in which packages they do 
not: one can see what Proportion of the people are being 
served by a given set of activities and in some sense howe 

) If the activities noted in the last paragraph occur in 
only two activity packages involving 15 percent of the 
population (see Figure 2, Clusters 1 and 2 marked by cross 
hatched) and these are the only activities into which 
government puts money » then there is obviously some 
inequitye But if one government agency puts money into 
supporting tent camping, trailer camping, fishing, hunting, 
and driving for pleasure while there are other activities 
into which other government agencies put money or if, for 
example, provincial responsibility is related to athletic 
activities and the federal agency is examining the 
activities into which they are putting money, then 
inequities that may be reflected by supplying facilities for 
activities in which only 15 percent of the population 
participate may well be irrelevant to the national level 
planner responsible for planning provision of specific 
facilities. 

The point is that, by defining groups of people 
according to the activities in which they participate, one 
gets a clear and multi-dimensional idea of who is receiving 
what support when money is put into several activities or 
into a large number of activities. Thus, when cluster 
analysis results that define activity packages are 
available, one avoids the error of thinking that because 15 
percent of the people in a population participate in one 
activity (eege an activity common to both Clusters 1 and 2) 
and 22e5 percent participate in another (an activity common 
to both Clusters 2 and 3), then monetary input in the two 
activities services 3765 (22e5 percent plus 15 percent) 
percent of the populatione (Actually, in the example cited, 
30 percent, that is 7-5 percent plus 7-5 percent plus 15 
percent, of the population is serwede) The author maintains 
that this kind of fallacy permeates the thinking of many 
officials who believe that planning for activities should be 
done on the basis of the total man-days of participation in 
each activity or on the basis of the proportion of the 
population participating in each of a number of activitiese 
But there is a need to distinguish between planning based on 
a fallacious use of information, and allocation of resources 
where equity considerations may be relevant from a 
geographical distribution of expenditure but not from some 


TN 32 CHAPTER VI page 405 


other perspectivee 

Given the points made above, the author claims that 
when cluster analysis results are available for a given 
city, it can be abundantly clear if the cluster analysis 
results indicate that the recreation plan for the city 
endorses or results ina Physicaily active minority of the 
people receiving drastically disproportionate amounts of 
recreation dollars, in comparison to those people who may 
have activity packages which contain relatively few, Say? 
passive activitiese (Romsa, see Reference 225 presents 
activity packages for Quebec that show the possibility that 
most recreation dollars go to a few peoplee ) 


Planning Implications 


Planning implications that can be derived from cluster 
analysis that do not relate to equity considerations (a 
policy consideration) may be understood easily, Siven the 
preceding discussione If a population can be broken into the 
kind of collectivities just described (collectivities based 
on activity packages), then the population has been broken 
into natural unitse This is because the various activities 
in an activity package may inter-relate. There may be a 
trade-off between activities (see Reference 3, 5, 11, 15, 
22) or some other relationship (see Reference 1)e An 
understanding of such relationships between activities, as 
well as a knowledge of the sizes of those collectivities 
that have an activity package making them a market for a 
facility package, allow the planner to recognize facility 
packages that are wanted by various segments of the 
populatione Also there is the possibility that the planner 
can recognize the kind of facility or activity 
"deficiencies" that may reasonably be left when one activity 
is allowed to substitute for another (see Reference 5)e« 

With the activity packages for the various "groups" in 
the population in mind, the planner can begin to see why 
large segments of a population will not participate in 
certain facilities if a certain set of activities is 
providede Given such information, planners can inform 
politicians of the "real Significance of a situatione When 
a political ruling (policy) on what will or will not be 
provided is to be examined, the planner who has cluster 
analysis results relevant to be given policy, can give the 
politician a clearer idea of what he is really asking about 
and why, with regard to a given policy, than is usually 
possible without cluster analysis results. The important 
point is that the planner who has cluster analysis results 
can inform the politician and/or manager of the consequences 
of given actions in a much more intelligent way than can be 
done when the only figures that can be provided are figures 
that show that some people do not participate in one 
activity or another. Cluster analysis thus allows the 
planner to better understand the structuring of behaviour. 
By allowing a better understanding of the structuring of 
behaviour, cluster analysis makes possible a realistic and 


TN 32 CHAPTER VI page 406 


comprehensive discussion of planning optionse Thus 
politicians and/or managers/decision makers can make better 
decisions regarding facility provision than are usually made 
today (at least the potential is there). 


Futures Research 


The preceding section could have started with the 
statement that: to say people behave in such a way as to 
define activity packages is almost a tautologye The import 
of the assertion that people's participation in activities 
defines activity packages really becomes clear when one 
recognizes that for purposes of futures research, activities 
do not have a futuree Rather, activity packages have 4 
futuree It is the people living today and having similar 
patterns of behaviour who, as they move into the future, may 
be expected to modify their behaviour so_ that activity 
packages shift as new activities shift into activity 
packages, or some activities shift out of some or all 
activitiy packagese 

The basic fallacy in much current recreation-leisure 
oriented futures research is that it approaches making 
projections into the future by projecting the future of 
activities, activity—by-activity: researchers approach the 
future of an activity by asking what is going to happen to 
this activity and, in doing so, fail to recognize that it is 
not the activity that goes into the future and behaves in 
Some waye It is people who, by Living from day to day and 
year to year, go into the future and either change or 
maintain their behaviour and attitudese Thus, cluster 
analysis of participation data results in appropriate units, 
the activity package and the collectivities associated with 
activity packages, with which to confront delphi panels when 
they are asked to make predictions of the futuree 

In other words, it is claimed that the task of a delphi 
panel should be to make projections for activity packages 
(1) in terms of the number of people in the population that 
will be in given collectivities associated with given 
activity packages and (2) make projections that indicate 
what activities may drop from a package; and what new 
activities may be expected in the packagee They must also 
suggest totally new kinds of activity packages that may 
arisee Then the panel must put the results of projections 
for all activity packages (new or existing) together to get 
an activity—by-activity perspective for the future, if for 
Some reason there is concern with having an activity—by— 
activity perspectivee 

The simple activity—-by-activity approach to projection 
confuses the issue of what is really happening over time by 
removing the insight that may be gained in recognizing a 
structuring of participation in activities in our societye 
For this reason, in order to make reasonable projections, 
one needs to recognize the fact that certain collectivities 
can be defined and used in making futuristic projectionse 
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Predicting Behaviour 


- Finally, a consideration related to the points made 
above is that cluster analysis offers insight into behaviour 
that is important in predicting behavioure The predictions 
that one needs to make in planning or policy evaluation may 
not be predictions of the future. Examination of the 
characteristics of a population in an area where survey 
results are not available may indicate that people are such 
that certain activity packages may be expected to exist if 
certain facilities are created (icee if certain policies are 
followed). Specifically, until we understand the behaviour 
of people in terms of activity packages, there will bea 
tendency to treat the prediction of behaviour on the basis 
of individual activities regardless of whether a large array 
of alternative activities is suppliede 

At present it does not seem clear that there is a 
methodology by which cluster analysis results can be used to 
make predictions of what will happen in terms of people's 
participation in certain activities in a given area under 
various policy options. Nevertheless, cluster analysis is 
the only methodology that defines the kind of activity 
packages that are here argued to be of importancee Thus, the 
results of cluster analysis offer information that is of 
value in making policy-related predictions and therefore it 
is presently the only methodological tool known to the 
author that has any promise of improving our productive 
ability by increasing our understanding of activity 
substitutability and complementaritye 


CONCLUSION 


The discussion in the preceding sections has presented 
considerations related to a number of practical applications 
of cluster analysise Though it would be desirable if all of 
these areas of application could be explained in terms of 
practical examples of what has actually been done, the 
practical application of cluster analysis to define activity 
packages to be used in the ways described has Simply not 
taken place. In fact, it is the purpose of this Paper to 
prompt practical planning work using cluster analysis 
results by setting down the suggestions and guidelines noted 
heree 


APPENDIX 


To understand why factor analysis should not be used to 
derive "activity clusters" one need only understand a few 
basic considerationse Factor analysis should be used only 
when the data for analysis are considered to have underlying 
dimensions common to all people (recall the early studies on 
Single or multiple dimensions of intelligence) or when any 
Sub-group of the population selected for analysis’ has the 
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same dimensions as the population as a wholee Data should 
not be considered as having an internal structure such as 
the one suggested by the clustering illustrated in Figure le 
Figure 1 suggests that the people in a community or nation 
(as suggested by Romsa's research on Canada —- Quebec results 
reported in the Journal of Leisure Research, see Reference 
22) may be broken up into collectivities in which 
individuals participate. 


Each one of the collectivities derived by cluster 


analysis is characterized by the intercorrelations among the 
activities which define the “cluster of people" that have 
the activity package (defined by the activities in which 
people in a given collectivity participate). 


Given that a number of collectivities encompass all the 
activity packages of a community, it may be asked what the 
intercorrelations of people's participation in activities, 


usually processed in factor analysis, reveale Actually, the 


intercorrelations tell one in large part about the relative 
sizes of various collectivities in a population rather than 
giving any information about the individual collectivities 
per see If certain sub-groups of the population of a city 
are selected for analysis, one does not get the same factor 
structure for the population as a whole unless dealing with 
a very particular kind of sample designed to be 
representative of the city as a whole: when a population is 
sub-divided on the basis of age, sex, education, income and 
other variables, the relative balance between collectivities 
changes in various sub-groups (this is confirmed by cluster 
profiles derived by Romsa, see Reference 22; and results 
presented in Currie, see Reference 9)e 

The point being stressed here is that factor analysis 
is the most appropriate tool to use in looking for structure 
in data only when (see Reference 14, 16) the invariance of 
factor structure for sub-groups in the population condition 
holdse Specifically, if the assumption of invariance of 
correlations based on people's participation in activities 
is to hold, the population that is being subjected to a 
factor analysis must not be structured with respect to its 
participation in outdoor recreation activitiese When such a4 
structuring exists, factor analysis is not an appropriate 
technique for learning something about the structuring of 
participants in the population of concerne 

As well as the fairly general theoretical 
considerations just noted, a more detailed critique of the 
factor analysis “clustering technique" could be based on 
reference to a number of pointse In particular, the 
instability (loose definition) of structures defined by 
varimax rotations or other factor rotation algorithms is one 
point that should be considered. Along this Line, it can be 
noted that algorithms have been developed to relate factor 
structures derived on one set of data with factor structures 
derived on another set of datae However, it has also been 
shown that because of the nature of the transformations 
involved, factor structures from two sets of data can often 
be related even if there is Little relationshipe 
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CORD STUDY TECHNICAL NOTE 37 


COMMENTS ON THE PAPER 
“THE SUBSTITUTABILITY CONCEPT: 
IMPLICATIONS FOR RECREATION RESEARCH AND MANAGEMENT" 
BY HENDEE AND BURDGE 


Je Beaman 


ABSTRACT 


In this article the author accepts the importance of 
the Hendee & Burdge paper but cites a number of important 
ideas related to substitutability that they did not include 
in their papere In particular a concern is expressed that 
the Hendee & Burdge article could lead to factor anlaysis 
being used to study substitutability when or in ways that it 
should not be usede Specific reasons are @iven that R-mode 
factor analysis should not be used to study 
substitutability. It is stressed that there has not been 
sufficient account of behaviour being a function of the 
factors inherent in the clustering of individuals according 
to their activity packages and it is claimed that such 
clustering of individuals should be studied by true cluster 
anlaysis techniquese 

One could Say that “the comments" provide the 
conclusion that activity groupings defined by R-mode factor 
analysis do not constitute an appropriately specified unit 
for use in evaluating user satisfaction or in facilities 
planninge At some time in the future, true cluster anlaysis 
results along with other information may provide information 
on substitutability that will be useful to planners. 


The Hendee and Burdge paper is very important because 
it contains a discussion of concepts that are beginning to 
play a significant role in recreation researche However, 
certain important ideas related to substitutability that 
have been developed in other articles are not mentioned and 
several of the articles that would have been useful to cite 
are not readily available. (see Reference 1, 35 9, 11) 

There are, nevertheless, published articles not cited 
by Hendee and Burd ge that deal directly with 
substitutabilitye One such article makes the point that when 
measuring substitutability it is important to distinquish 
between: (1) the clustering of activities obtained by a 
factor analysis of the correlations between variables giving 
frequency of participation in activities and (2) the 
clustering of individuals on the basis of the activities in 
which they participate. In a Canadian Outdoor Recreation 
Demand (CORD) Study Report (see Reference 23), Rousseau 
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makes the following statement: 


The approach of analysing participation data to 
find clusters of people who have similar activity 
patterns may be contrasted with an approach 
described by Burton and Noad (see Reference 6)e. 
This method is applied to 1969 National Survey 
data ina thesis prepared by Gillespie (Reference 


11 )e In Burton's "clustering approach" 
correlations between activities are examined 
without consideration of whether the correlation 
between activities reflects a clustering of 
individuals or note 

Burton ( Reference 5) has suggested that 


correlations between activities reflect trade-offs 
that can be made in planninge However, the "true" 
clustering approach used by Romsa et ale 
(Reference 22) only supports’ Burton's claims when 
the correlations observed have the right value for 
individuals within a clustere Even when 
participation in activities is correlated for 
individuals with the same activity package, two 
activities may be complementary not substitutable 
(see Reference ie) 


The Bishop material to which Hendee and Burdge refer 
under the title WAn Empirical Illustration of 
Substitutability" is an example of the Burton type of 
substitutability analysis to which the Pousseau article 
takes exceptione Burton's "clusters" are essentially the 
Same as the packages of "interrelated" activities reported 
by Bishop (Reference 2) and confirmed to be found in other 
analyses (Reference 24 )e 

An important theme of these notes is that Burton's R- 
mode factor analysis should not be used to derive activity 
clusters if these clusters are to reflect substitutable 
activitiese To understand why this thesis is supported, it 
is necessary to introduce a few basic considerationse 
Firstly, R-mode factor analysis can be used to recognize 
linearly "related" variablese The related variables can be 
recognized by examining factor loadings but the relation 
only means what it is usually taken to mean when the data 
have underlying dimensions possessed by all peoplee One 


should recall early research on single or multiple 
dimensions of intelligencee The point is that R-mode factor 
analysis is most appropriately used to look for the 


structure in data only when the invariance of a factor 
structure holds for sub-groups in the population (see 
Reference 14, 16): any sub-group of the population selected 
for analysis should have the same dimensions as’ the whole 
populatione 

In practical terms, then, data should not be considered 
to have an internal structure such as that suggested by 
Romsa's research on Canada if R-mode factor analysis is to 
be used to discover related activitiese Romsa's Quebec 
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results (Reference 22), which were not cited by Hendee and 
Burdge but which did appear in the Journal of Leisure 
Research (Reference 15), show that Quebec's population may 
be broken up into collectivities of people on the basis of 
the activities in which individuals participate, that is, 
divided into collectivities having different activity 
packagese Obviously, each collectivity discovered by Romsa 


is characterized by the intercorrelations between the 
activities defining that "cluster of people", not by 
aggregate between activity correlations for “the 


population". 

Yet the wary reader may feel that R-mode factor 
analysis could be used to get factor scores and that factor 
scores could be used to find clusters of individuals who 
have similar scores on the dimensions discovered by the R- 
mode factor analysise Many things are possible: one can even 
learn the same things from R and Q-mode factor analysese 
However, one who is familiar with cluster analysis knows 
that even a Q-mode factor analysis is not a particularly 
good methodology to discover a structuring of behaviour Like 
that found by Romseae 

A more detailed critique of the use of the (factor 
analysis "clustering of activities technique" to study 
substitutability can be based ona number of pointse (In 
this connection, it may be noted that algorithms have been 
developed to relate factor structurese ) However, because of 
the nature of the transformations involved, the factor 
structures from two sets of data can often be related even 
if there is little true underlying structural relationshipe 
Possibly of more importance theoretically is the fact that 
the varimax solution is only one of an infinite number of 
solutions to "the" factor analysis probleme Why should this 
solution be singled out as the one in which factor loadings 
show a person about substitutability? Bishop's work 
(Reference 2) and comparative articles such as Schmitz— 
Scherzer et ale (Reference 24) should be evaluated with this 
point in mind when one is tempted to use the results to 
point out meaningful collections of activities to be used in 
planning or policy makinge 

Given the results of Schmitz-Scherzer et ale, the 
statements just made should not be taken as implying that 
the R-mode factor analyses do not produce some interesting 
insights about some thingse The concern raised by the 
preceding paragraphs is that factor analysis of the 
correlation between variables will almost certainly give a 
number of incorrect impressions if it is used to understand 
substitutabiLlitye So, it is because of Hendee and Burdge's 
reference to Bishop's work and their lack of reference to 
Tatham and Dornoff (Reference 27) and to Romsa's' papers 
(Reference 22) that the preceding discussion is considered 
particularly importante Hendee and Burdge did not 
specifically endorse research using R-mode factor analysis 
to study substitutability but there is the clear possibility 
that their article could result in such research being 
startede 
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| Other problems involved in understanding 
substitutability that were not adequately discussed may be 
understood by examining specific considerationse In the 
Beaman and Leicester monograph (Reference 1), a major focus 
is on breaking a population into socially meaningful 
collectivities for which clusterings of activities and 
activity packages related to behaviour can be discussede 
They stress the need to recognize collectivities such that 
when any individual from a given collectivity is ina given 
circumstance defined by availability of (1) recreation 
facilities, (2) time, (3) people with whom to participate 
and (4) mood, the decision made regarding participation in 
an activity at a given facility will be typical of the 
_decision made by any member of the collectivity.e The O*Leary 
and Field (Reference 19) and other articles cited by Hendee 
and Burdge clearly indicate that the kind of collectivities 
to which reference has just been made are defined by social 
variables in addition to the participation variables studied 
| by Romsae But Hendee and Burdge do not carry their 
discussion forward to draw the conclusion that behaviour 
) will be a function of all the factors just notede Nor do 
they adequately note that behaviour being a function of the 
factors noted has important analyses methodology (research) 
implicationse The article implies that substitution depends 
on (1) with whom, and (2) other socio-economic variables 
including agee However, this raises few difficultiese One 
important implication indicating real problems is that 
supply distribution results in behaviour failing to "mirror" 
preferencee Does substitutability depend on behaviour? Or, 
does it depend on preference? Regardless, it must be made 
abundantly clear that until there is recognition of the 
collectivities within a population that have common activity 
packages and other common behavioural and social 
characteristics, and until it is determined how substitution 
relates to available supply, trade-offs between activities 
will at best be poorly understoode 

An issue raised by the Rousseau quotation included 
earlier, but not mentioned by Hendee and Burdge, is that 
substitutable activities must be distinguished from 
complementary activitiese Participation in activities by 
people with a common activity package may be highly 
correlated for two reasonse High correlations may reflect 
the fact that people recognize certain activities as 
substitutablee Cr the activities may be complementary; in 
which case satisfaction gained from participating in one 
activity is contingent on participating in the othere This 
example may be interpreted in a trivial way: vigorous 
exercise may be satisfying only if the possibility of 
washing afterwards existse Are Hendee and Burdge dealing 
only with substitutability, or are they implicitly concerned 
also with complementarity? 

In conclusion, once one recognizes the importance of 
distinguishing collectivities within a population which have 
activity packages defined by a method other than factor 
analysis, it becomes clear that questions, for example, 
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about the satisfaction of facility users should be raised on 


the basis of these collectivities.£ Activity groupings 


defined by R-mode factor analysis do not constitute an 


appropriately specified "unit" for use in evaluating user 
satisfaction or for use in planning what should be providede 
It is with respect to collectivities defined on the basis of 
Similar personal participation "“sets" that meaningful 


planning must take place, and it is with respect to these 
collectivities that planning options and ramifications of 


political decisions must be evaluated (see IN 32)-« 


Similarly, the determination of activities for which | 
there are no substitutes is meaningful only in the context 


of collectivities of people with similar behavioural "sets". 
In this connection, Romsa's work shows that there exist 
large numbers of people for whom there are probably no 
substitutes for the very few activities in which they 


participate. In contrast, the results obtained by Gillespie 


using factor analysis, and the same data analysed by RomBay 


suggested that there are large numbers of trade-offs that 
apply to all Canadianse Gillespie (following the Burton 


framework) confused correlations in the aggregate with those 
for meaningful collectivitiese 


In pursuing the important research area reviewed by 
Hendee and Burdge, it is essential that people not make the > 
kind of mistakes in interpretation just noted or carry out 
inappropriate analysese It is because of this fear that | 


these comments have been written, not because of any basic 


disagreement with Hendee and Burdge on the need to do 
substitutability studies or on why such studies Should be — 


donee 
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REVIEW 


Je Beaman 


The most appropriate comment on this chapter appears to 
be that one should not become overly involwed in thinking 
that TN 14 and 37 have practical implicationse The TN on 
"Distance as a Function of Distance" while clarifying 
something about how decisions may be made does not dwell on 
the fact that all the distance functions considered are so 
close in shape (as can be seen from Figure 1 in the article ) 
that it makes little difference which function is used ina 
particular modelling efforte One function may be better than 
another for predicting trips made by people who come from a 
long distance, but this depends on the type of trip being 
considerede 

Certainly, there appears to be merit in using a 
function like the exponential function which does not show 
extremely high use levels when a park is very close to 
peoplee However, just because the exponential function does 
not become infinite when the distance to travel to a park 
equals zero does not mean it has the appropriate shape to 
reflect behavioure It seems to this writer that in fact, as 
the distance to a park becomes shorter and Shorter, after 
the park is already close the number of recreation trips 
made into a park would not change very much and this is the 
case with exponentials and linear "gravity" functionse But 
selecting the correct distance function is of minor 
importance compared to other problems in specifying the 
correct structure in modelling travel flowe 

None of the articles in this volume (with the possible 
exception of TN 20) are explicit about "magnitudes of the 
problem" associated with different structural difficulties 
with modelse TN 19 and 35 present results which are typical 
of what is .found when one proceeds to use a Simulation 
approach to test how well models are workings The fact that 
they are not working well is apparent, particularly when one 
tests the Cesario model (TN 4) or Cheung Day-Use model (TN 
1) in this way (see TN 4)e If the models were structurally 
sound an R* of e9S rather than the Lower R2 found should be 
expectede The major reason for the structural problems is 
not because of errors in the distance function: at least 
this is indicated by the fact that R® changes little when a 
variety of distance functions are considerede Problems with 
the way attractivity is measured, with the way alternative 
factors are considered, with the total structure of models 
(addition or multiplication etce ) and with the 
disaggregation of visitors are among the factors cited in 
this volume that need to be dealt with much more than the 
esoterics of choosing distance functions to reflect 
behavioure 

Turning to TN 37, this reviewer can look back to his 
own 1970 work and there he can find statements that the 
problem of substitutability is so complicated that it is not 
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likely that in the near future (1) sufficiently Large data 
sets and R2) necessary theoretical development will take 
place to allow one to make effective use of this t*property 
behaviour" in planninge Still, if unclear about the 
consideration just noted, TN 37 is definitive in pointing 
out the confusion that presently exists about how 
substitutability analysis should proceede Incidentally, 
cluster analysis results are not simple to interpret and 
getting them often involves use of mammoth amounts of 
computer time if a reasonably large number of cases are to 
be dealt with so that populations can be split into a number 
of meaningful groupse Regarding the matter of interpretation 
of clustering results, one should note that the discussion 
of TN 37 totally ignores the influence of supply on 
substitutabilitye One cannot simply move ahead using the 
methods described in TN 10 to do clustering and then examine 
clusterings to find out about substitutabilitye The reasons 
that activities are substitutes in one place and are not in 
another relates to the availability of supplye Availability 
of supply does not simply relate to whether supply exists or 
does not existe One must consider levels of supply, its 
attractivity and programs that encourage or discourage use 
among certain groups (among other things). From a practical 
perspective, traditional discussions of substitutability 
ignore the fact that education is one of the primary 
mechanisms that can be used to change behavioure 
Substitution as it exists presently may relate very much to 
what is available, but that does not mean that substitution 
in the future cannot relate very much to how we teach people 
to respond to different opportunitiese 

Moving on to TN 32, a primary criticism that may be 
laid against it is that it should have’ included practical 
examplese Since the note was prepared, a practical example 
of the application of clustering to National Park User 
Survey information has occurred (see Honours Project by Le 
Leey Department of Recreation, University of Waterloo, 
1975)e This analysis did result in some important findings | 
for planninge It showed, for example, that only about 5 
percent of the visitors to a particular park participated in 
three or more interpretive activitiese These visitors came 
primarily from the people who stayed two, three or more 
nights in the parke But, Little more was found out about 
these visitors for avery important reasone The cluster 
analysis study, because of how Little could be said about 
visitors, showed that many of the questions asked in the 
study had Little value in understanding the visitor to the 
parke In contrast to what was asked, such important 
questions as questions about visits to the townsites, within | 
Banff or Jasper, and motivational questions about the why 
and wherefor of a park visit were not askede 

Further comment on TN 32 might be appropriate if it 
were not for the fact that TN 13 (which appears in Chapter 
9) presents some comments on the importance of clustering in 
making projections of future recreation behaviour that 
elaborate on the points made in TN 32e 
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Regarding equity considerations and other applications 
of cluster analysis raised in the article, it is fair to say 
that the more detailed reference could be made to TN 10 
resultse Nevertheless, in such a review it is probably 
sufficient to suggest that some article in the CORD Study 
should have made more of the fact that a very high 
percentage of Canadians do not participate in more than a 
few of the more than 18 outdoor activities receiving 
consistent consideration in CORD Study National Surveyse The 
justification for not pursuing the policy implications of 
the “equity of participation" findings is the problem of 
whose policy to considere It seems to the reviewer that 
descriptive findings of practical interest could have been 
presented without getting too involved in policy matterse 
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CHAPTER VII 


SURVEY METHODS —- ANALYSIS METHODOLOGY 


INTRODUCTION 


All the articles included in this chapter arose because 
of the need to pursue certain issuese The nature of these 
needs is most easily seen in the context of a number of 
comments about why different papers were preparede For 
example, very brief comment gives the reader the necessary 
background about the preparation of the first three Notes in 
this chaptere TN 15 was prepared because a number of CORD 
Study documents found it difficult to understand what 
analysis methodology described in TN 12 really accompLishede 
By using an example in which the meaning of effects could be 
more easily analyzed, it was felt that many readers would 
see the results in such a way that they could relate them to 
the socio-ecomomic effects referred to in TN 126 It was 
quite by accident that data on distance from the Continental 
Divide and elevation were studied in relation to the depth 
of snowfalle The data were provided to the CORD Study 
researchers at Parks Canada by Thorsell Banff, a researcher 
whose studies of wilderness use are well-known and who is 
now (1976) employed in Alberta. 

One might have thought that comparing the CORD Study 
National Survey results with other data which are available 
on people's participation in the same activities would be an 
obvious step in any research where there is a concern with 
reliability and validity. This is what prompted making the 
comparison presented in TN 24.6 In fact, making this 
comparison is one of the factors that motivated the 
inclusion of questions about different types of hunting and 
fishing in the 1972 CORD Study of people's participation in 
outdoor recreation activities (see the CORD Study Data 
Documentation Volume )e 

TN 38 was prepared when it was necessary to produce 
rather quickly figures on the value of park experiencese As 
those familiar with the procedure used by Knetsch and Cheung 
in preparing their estimates for TN 31 realize, it is not a 
trivial matter to compute a demand function for the use of a 
given parke When there is the possibility of obtaining 
demand functions for hundreds of parks with different 
characteristics, one is prompted to look for a methodology 
that has a relatively sound theoretical basis and still can 
be guite simply applied. The fact that developing such a 
methodology pays other dividends in terms of showing 
researchers the problem in estimating the time bias is 
simply one of those fortunate coincidences that characterize 
much of the progress that has been made in researche 

TN 21 has an interesting historye The first design work 
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on CORD Study surveys of park use was the work done for a 
1968 pilot surveye As indicated in the Data Documentation 
Volume, when plans actually went ahead for major park users 
surveys in Caneda in 1969 and i970, this pilot survey was 
scrapped in favour of using an entrance survey methodology 
proposed by Chubb and Crapoe The design work for the 1969 
survey actually had much to recommend it, given what was not 
known about park use (wolume of park use on different days, 
etce Je However, the way the days on which to survey were 
defined and the sampling rates, recording procedures for 
keeping track of forms handed out, etce, were specified did 
not Lead to good resultse Large volumes of information were 
lost, surveyors did not adhere to schedules, there were no 
records of hours of the day during which questionnaires were 
handed out, etce 

However, the experience did prompt the statistician who 
was responsible for the earlier design to come up with a 
much more sophisticated system in 1971. This was no longer 
for the CORD Study but for a National Park User Survey. The 
Survey was intended to meet the objectives of Parks Canadae 
Users of the data collected were to be able to make seasonal 
use estimates and to give profiles of entry over different 
types of daySe In deweloping the design, there was also the 
practical concern of seeing that the surwey staff stayed as 
busy as possible so that accuracy was maximizede The 1971 
Design evolwed in 1972 and 1973 so that inTN 21 it is 
possible to report ona survey system that is improved but 
recognized to be still far from optimal. In the paper one 
only sees the pinnacle of development in 1973 and not the 
agonizing effort that preceeded ite The reader should be 
careful to recognize that the problems that can be seen at 
the 1973 stage do not reflect initial poor work but rather 
the cumulative improvement that can take place as experience 
is gainede 

TN 8 is much more important in the history of the way 
the Canadian Outdoor Recreation Demand Study evolved than 
the results indicatee Thinking of the issues which prompted 
the development of the note resulted in conceptual 
clarification both about model development and problems of 
data collectione Explicitly confronting the issue of how to 
use information about the occupancy of campsites on 
different days during a season (given the nature of the use 
pattern involved) reinforces’ one's concern with the 
difference between week-end use and week-day usee It also 
stresses the fact that there are not only week-end and week-— 
day users of parks but any number of kinds of users to whom 
different models (different reactions to distance, different 
park attractiveness, etce) must applye So this note should 
be read in the context just describede However, it should 
also be noted that the analysis procedure described has been 
used to extract information about an expected use pattern 
for a park from limited survey information and this use 
pattern has been applied to make week end, week day-use 
estimates for plannerse Incidentally, Scottish researchers 
Dufield and Archer have recognized the merits of carrying 


CHAPTER VII page 422 


out a similar procedure in studying tourism in Scotland. 

In Chapter VI, reference was made to Romsa's et ale 
methodological work on deriving recreation activity packages 
using CORD Study National Survey datae Another methodology 
has also been used to analyze these data: this is a 
methodology proposed by Burton for deriving clusters of 
activities. Thus, it is possible to present in one paper the 
consequence of carrying out an analysis to define clusters 
of activities and to define activity packages. Such results 
are presented because early in the CORD Study it was deemed 
extremely important to deviate from an activity by activity 
analysis perspective and examine people's behaviour in terms 
of the broad range of activities in which any one individual 
may participate. Romsa was engaged by Parks Canada on 
contract to carry out a cluster analysis in such a way as to 
group people on the basis of the activities in which they 
participatee This was at the same time that Gillespie was a 
student of Burtone - Gillespie took advantage of the 
availability of the 1969 CORD Study Participation in Outdoor 
Activities data to prepare a thesis in which he showed the 
results of applying the factor analysis approach of Burton 
to derive clusters of activitiese 

It was only in 1975 that the decision was made to 
produce a paper that included the results of both analyses. 
This was done so that the analyses could be seen together 
and thus could be used (1) in getting a better view of the 
issues raised in TN 32 and 37 and (2) so that the results 
could be used by managers and researchers to get a 
perspective on what types of people the Canadian population 
encompasses in terms of their participation and outdoor 
activitiese 

TN 18 arose in a very interesting waye Knetsch, after 
being closely involved with the CORD Study for many years 
(1966 —- 1972), accepted an appointment in Malasyiae However, 
while there he continued to work on CORD Study data because 
of his interest in the studye He was concerned, for one 
thing, because in developing destination models it was very 
often the case that the models did not appear to be as good 
as they should bee At the same time the CORD Study research 
group with Parks Canada was investigating this same probleme 
So when a paper by Knetsch arrived in Ottawa, the question 
that Parks Canada researchers raised was not whether’ the 
method that Knetsch proposed for weighting observation in 
carrying out a regression was valid, but whether his weights 
were the best weights to usee In trying to answer’ the 
question it was found that a set of weights, which are 
appropriate in a large number of circumstances, could be 
derived theoreticallye This was the beginning of TN 19 in 
its present forme However, as work on it continued it became 
clear that the theoretical results opened up the possibility 
of making a test as to the actual acceptability of a given 
modele From this sequence of events, TN 19 arosee 

In 1973, while Knetsch and Parks Canada researchers 
were working on one version of TN 19, Goodchild was engaged 
on contract by Parks Canada to do work testing models to see 
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if they were structurally sound and gave accurate 
descriptions of origin destination flowse The research on 
using weighted regression was already far enough developed 
that Goodchild made use of the material and was able to 
comment on using a simulation approach and various weighted 
and unweighted regression procedures to test the goodness of 
models. He was able to comment on how a comparison between 
Simulated results and real data indicated whether a model 
was valid or note Uitimately, TN 35 came to be (in many 
respects) similar to TN 19 which it was originally to. 
complements Nevertheless, TN 35 makes unique contributions 
by presenting results of structural error on estimating 
model parameters accurately, and by illustrating a number of | 
matters in relation to accuracy achieved, using weighted or | 
unweighted regression, with or without non-linear estimation 
methodse Goodchiltd's TN 35 is complementary to, not. 
redundant with, TN 19e 

Some of the issues discussed in Goodchild's TN 36 have 
been recognized for some time as people such as Cicchetti, 
Ferard and Davidson worked on origin models (see TN 34)e But | 
concerns discussed have not been brought togethere It was | 
the recognition of this fact that prompted Parks Canada to | 
have work done on the value R? should be expected to have . 
when the the kind of models used in TN 12 (also TN 6, 20 and 
29) are developede 

The final note in this chapter (TN 42) was prepared 
because many systems for handling geographic information are | 
being used (perhaps misused is a better word) by recreation. 
researcherse Little information is available that allows the | 
practitioner to know what such systems are capable of doings | 
A simple overview of the issues involwed in processing | 
Simple geographic information has not been availablee The | 
paper presented, meets some of these needs by presenting — 
general concerns and specifics about a system that was 
developed using a knowledge of problems that were relevant | 
to CORD Study and other recreation researcherse 
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CORD STUDY TECHNICAL NOTE 15 


DEPTH OF SNOWFALL 
AND ITS RELATION TO DISTANCE 
FROM THE CONTINENTAL DIVIDE 
AND ELEVATION ABOVE SEA LEVEL 


HeKe Cheung and Je Beaman 


ABSTRACT 


The objective of this paper is to present the results 


of determining the parameters of an equation to be used to 
predict snowfall in terms of the two variables, elevation 
| and distance from the continental dividee 


The note is not provided as seminal or definitive 
researche (It is often difficult to obtain simple examples 


of the application of a technique that can be used in a wide 
| class of situationse) It does offer a simple and easily 
understood application of the analysis of variance to a 
practical probleme 


OBJECTIVE 


This paper presents the results of determining the 


| parameters of an equation to be used to predict snowfall in 


terms of the two variables, elevation and distance from the 


| continental dividee It will prove useful to the reader who 


is not familiar with the analysis of variance by providing a 
simple example that will enable him to give a firm, 


intuitive grasp of the kind of analysis that can be carried 


out when some variables are nominal independent (for 


example, occupation and marital status )e 


APPROACH TO ANALYSIS 


The data given in Table 1 were available for analysise 
These data provide information that may suggest a distance 
or elevation relation of some Significance in searching for 
potential ski areas on the basis of available information on 
elevation and location of the dividee Elevation and 
distance from the divide could also be treated as continuous 
(interval) variablese 

There was no reason to expect a linear relationship (or 
any particular relationship, for that matter) between the 
variables considerede Given the limited data available, it 
was decided to use an analysis of variance model (see 
Reference 50) to describe the effects that are described 
latere Use of an analysis of variance model requires that 
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TABLE 1 


SNOWFALL DATA FOR THE CANADIAN ROCKIES 
(IN THE AREAS INDICATED ) 


Elevation West Distance Average 
(feet ) from Divide Annual 
Snow Station (mie ) Snowfall 


(19 yr avea) 


Oi Canmore 4350 24 62 
O2 Kanaskia Cabin §500 8 134 
03 Kananaskis Lookout 6800 10 151 
04 Pocatterra Creek } 6720 12 123 
OS Highwood Pass 7250 6 173 
06 Mud Lake 6220 8 180 
O07 Evans-The Creek 4950 20 103 
O8 Whiteman Pass 5600 12 85 
09 Elbow Lake 7150 12 175 
10 Exshaw 4200 21 16 
11 Kanaskis Station 4560 102 
12 Anthracite 4550 20 64 
13 Banff Town 4582 17 73 
14 Le Louise* 5032 5 193 
15 Sunshine (3 yre ave) 7200 0 308 © 
16 Norquay (3 yre ave ) 6120 17 100 


* Note discrepancy in Plan for snowfall at Louisee 
<> <>< ><K><> | 


data be grouped to give categories but, because data was 
Limited, this grouping was not expected to cause a great 
loss of informatione It also made it possible to avoid the 
assumption that snowfall was a function of distance from the 
divide and elevation of the form: 


Snowfall = U + C(1) (elevation) + C(2) (distance from divide) 


WHERE Cl and C2 are constantse« 


i] 
| 


The categories used in grouping data were selected to 
correspond with "natural" groupings in the data and so that 
the parameters to be estimated (could) be estimated. The 
design matrix was of full rank and did not involwe too much 
multicollinearity between cemponents being considerede The 
categories used are given in Table 2. 
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TABLE 2 


CATEGORIES USED IN THE ANALYSIS OF VARIANCE 


ELEVATION CATEGORIES DIVIDE DISTANCE CATEGORIES 
Elevation in Distance from 
feet above divide in 
sea level miles 

Category Category 

1 Up to 4999 1 0 - 6 

2 5000 — S999 2 7 - 12 

3 6000 — 6499 3 13 - 19 

& 6500+ 4 20+ 


OLOLOLO LS: 


The model for which estimation was carried out can be 
interpreted as follows: 


Total snowfall for an elevation in Category X 
and the distance from the divide in Category Y 


General snowfall 


ao 


Elevation effect for elevation Category X 


+ 


Distance from the divide effect 
for distance Category Ye 


RESULTS OF ESTIMATION 


Many ways exist to estimate the "effects" that the 
equation above indicates must be estimatede They are outside 
the scope of this papere (For two methods, see Reference 
003 for discussion of methods using standard regression 
programs, the dummy variables method, see Reference 00-) 
When analysis of variance of snowfall data was carried out, 
a fairly good explanation was achievede A usual criterion 
for goodness of explanation is the ratio R* which is the 
explained sum of squares to the total sum of squarese An Re 
of e729 was achievede Estimated values of coefficients for 
which results were obtained are given in Table Je To show 
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how well the predictions correspond with observations, Table 
4 presents both observed and predicted resultse 


TABLE 3 
ESTIMATED PARAMETER VALUES 


General Snowfall Lewel = 130-95 inches 


ELEVATION EFFECTS 


Category Categories meaning Effect 
(Elevation in feet ) 


1 Up to 4999 9-33 
2 5000-5939 ~41.63 
3 6000-6499 3033 
4 6500+ 297 
DISTANCE EFFECTS 
(Miles from divide ) 
1 0- 6 106.61 
2 7- 12 18.72 
3 13 - 19 —-61.28 
4 20 + ~64.03 
<>< ><> <> 


Table 4 actually thas several functionse By looking at 
it, one can see the effects that were added together to get 
a given observatione Given a hypothetical situation of the 
snowfall at 7,500 feet at 32 miles from the divide, the 
estimate of snowfall is obtained just as it is in Table 4 
for a known sitee Also, from the estimated parameter values 
in Table 3 it is seen that the combination of 6000-6499 feet 
of elevation and 0-6 miles of distance from the divide will 
have the largest positive effect on snowfall-e Whether this 
is reasonable requires a thorough knowledge of the geography 
of the study areae 


Snowfall = general snowfall + (6,5000 + £t elevation effect ) 
+ (20+ miles distance effect) 


There is the possibility that the residuals shown in 


Table 4 show some kind of systematic errore There could be a 
distance-elevation interaction meaning that combinations of 
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distance from the divide and elevation may be conducive to 
high or low snowfall in away that is not expressed by 
simply adding an elevation effect and a distance effect. 
Patterns such as these do not seem apparent in the datae 

It is also possible that large differences between 
observations and predictions that are seen in Table 4 may be 
related to a third variable, which was not considered in 
this analysise Obvious variables to consider are orientation 
and steepness of the slope on which the measurement was 
takene Or the residuals may indicate something about the 
nature of the divide near the area being considerede Large 
negative values of residuals may indicate one kind of divide 
terrain while positive residuals may indicate quite 
different geographic characteristic of the nearby dividee 
Such factors are likely to be picked out by somebody with a 
"feel" for the areas but which are almost never detected in 
a purely ad hoc statistical approach to a probleme 


TABLE 4 


OBSERVED AND PREDICTED VALUES 
FOR THE OBSERVATION UNITS 


Unit ObsSe Predicted Snowfall Residual Eleve Diste 
Snow (inches ) Variable 
fall Value 

O01 62 76 = 131 +t 9 —- 64 = 76 - 14 1 4 

02 134 108 = 131 - 42 + 19 = 108 26 2 2 

03 151 151 = 131 + 1+ 19 = 151 O af, 2 

04 123 151 = 131 + 1+ 19 = 181 - 28 4, 2 

05 173 239 = 131 + 1 +107 = 239 - 66 4 1 

06 180 180 = 131 + 30 + 19 = 180 0 3 2 

O07 130 76 = 131 + 90 —- 64 = 76 27 1 4 

08 85 180 = 131 -—- 42 + 19 = 108 - 23 2 o 

09 175 151 = 131 + 1+ 19 = 151 24 4 2 

10 716 76 = 131 + 9 — 644 = 76 0 1 & 

11 64 76 = 131 + 9 —- 64 = 76 - 12 1 4 

12 19 79 = 131 + 9- 61 = 79 ¢] 1 3 

13 193 193 = 131 - 42 +106 = 195 - 2 2 1 

14 308 238 = 131 + 1 +106 = 238 70 & 1 

15 100 100 = 131 + 30 - 61 = 100 0 3 3 

q><>< >< <> 
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DISCUSSION OF THE RESULTS 


When examining Table 4 one sees that the range of | 
residuals varies from ~66 to 70 with an average error of | 
14.2 percente It is interesting to note that among the. 
residuals, the two largest (-66 and 70) are associated with 
observation units that are at short distances from the | 
divide and have high elevationgse Reasons for this large | 
discrepancy are not immediately apparent from the datas Some | 
variables that are of importance in explaining snowfall (and 
that were excluded from the anatysis) may account for these | 
large residuals. Nevertheless, this model as it is may yield | 
poor prediction of snowfall for an observation unit at a 
high elevation that is also at a short distance from the | 
dividee On the other hand, it seems clear that it may give | 
relatively good predictions for those observation units for | 
which other combinations of elevation and distance from the | 
divide existe. | 

The use of average annual snowfall as the dependent | 
variable may present some problems; since tlarge extreme | 
values of snowfall were observede Since the mean snowfall is | 
affected appreciably by large extreme values, the median | 
might have been a better dependent variable that could be 
used in the anatysis is the ennual (or even monthly ) | 
snowfall for each observation unite The use of time series | 
data offers the advantage that the situations being modelled | 
are more truly reflected in that variations that occur over | 
time are open to explanatione On the other hand, the use of | 
time series data requires the specification of variables and 
related parameters to be used to account for over time! 
(annual) variations in snowfalle In a model developed for) 
time series data it might occur that a multiplier to correct | 
for level of snowfall could be applied so that: | 


Observation for year t = (Level multiplier for year t) * 
(general mean + elevation effect + distance effect )e 


When analysis is carried cut on more data, it will be 
possible to correct for different variances on observations 
and intercorrelations between observationse The present 
analysis simply ignores the fact that observations at 
different stations are almost certainly correlatede~ Also it 
is reasonable to think that there will be more variation. 
(absolute) in amount of snow in high snowfall areas than in 
low snowfall arease Though the lack of knowledge about these 
factors does not cause a bias in the results presented, it | 
lowers the efficiency of the estimates, meaning that better 
predictions could be madee 

It may have occurred to some readers that the parameter 
values in Table 3 are difficult to accept in a practical 
sensee For instance, the difference between elevations 5999 
feet and 6000 feet is only one foot and yet the difference 
in the effect of snowfall is almost 72 inchese The 
difficulty clearly is one of interpretatione The reader is 
reminded that the explanatory variables used are not 
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continuous but are categorical or classificatorye As such, 
the parameters presented must be interpreded accordingly: 
the difference in effect on snowfall between elevation 
category 2 and category 3 is almost 72 inchese One could 
associate parameter values with a category mean and then 
draw a curve through the parameter valuese The “appropriate" 
parameter for an elevation could be read from the curve and 
this estimate would not suffer from the problem just notede 


CONCLUSION 


The model derived suggests a reasonable means of 
predicting snowfall for a weather station in terms of its 
elevation measured in feet and its distance from the divide 
measured in milese There is no assumption that depth of 
Snowfall varies largely with the two variables considered; 
and the results suggest that it does note It is only assumed 
that the effects on snowfall due to the explanatory 
variables can be separated so that the overall effect is the 
sum of the individual effectse Interaction between the 
explanatory variables used was not hypothesized, since such 
a relationship was not apparent from the data providede 
Exclusion of explanatory variables that are significant in 
predicting snowfall (such as the nature of the terrain) may 
have caused the two large residuals mentioned beforee Of 
course, the adequacy of the model to predict can only be 
determined after it thas been applied to additional sets of 
datae 


TN 15 CHAPTER VII page 431 


CORD STUDY TECHNICAL NOTE 24 


THE COMPARABILITY OF NATIONAL SURVEY DATA 
ON PARTICIPATION IN HUNTING AND FISHING 
WITH OTHER DATA 
ON THE SAME OUTDOOR RECREATION ACTIVITIES 


Se Rousseau 


ABSTRACT 


Research scientists can have little faith in results 
obtained and conclusions $drawn from ae study if the 
reliability and wabid.ty, of the data are not known or cannot 
be establishede 

This paper takes a ten toward measuring the 
reliability and validity of data from certain national 
surveys on participation of Canadians in outdoor recreation 
activitiese The procedure followed is tousnGh) test. 
reliability by determining if the measurements are 
consistent, and (2) test validity by comparing the results) 
of national surveys with the findings of independent 
studiese ; 

In the comparisons reviewed, a variation of 10 percent 
between the data of the national surveys and that of the 
control sources is sometimes used as the critical threshold 
values When the data permit, significance tests are applied 
to discover if the differences noticed between percentages 
of the national surveys and the percentages of the "control" | 
sources were chance differences due to sampling or note | 

In the first part of the paper, the statistical data 
used are presented and some explanations about the national. 
surveys are Sivene The significance tests used are. 
explained in a second parte Data on hunting and fishing are 
compared in the two remaining sectionse 


OBJECTIVE 


Criticism concerning the reliability and validity of | 
data have been raised about some national surveys’ on 
participation of Canadians in outdoor recreation activities. 
because no reliability and validity analyses had been | 
presentede The objective of this paper is to take a step. 
towards measuring the reliability and validity of some CORD 
Study National Survey resultse 
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INTRODUCTION 


: Concern for reliability comes from the necessity for 
dependability in measurement andy in its simplest form, 
refers to the prospects for obtaining consistent and similar 
measurements when a data collection procedure is replicatede 
Kerlinger (see Reference 36) approaches the concept of 
reliability with three questionse 


le If we measure the same set of objects again and 
again with the same or comparable measuring 
instrument, will we get the same or similar 
results? 

2e Are the measures obtained from a measuring 
instrument the "true" measures of the property 
measured? 

3e How much error of measurement is there in a 
measuring instrument? 


In a similar vein Kendall and Buckland (see Reference 35) 
propose a functional definition of reliability which takes 
into consideration the concepts of stability and random 
errore Reliability is conceived as "that part which is due 
to permanent systematic effects, and therefore persists from 
Sample to sample, as distinct from error-effects which vary 
from one sample to anothere" 

The merits of having reliability figures as a measure 
of accuracy are obviouse However even if the statistical 
results are proven to be accurate, statistically stable and 
precise, this alone cannot convince the reader that the 
results obtained are valide Reliability does not take into 
consideration the congruence of the object measured with 
that of the measuring instrumente In practical terms, 
techniques for assessing reliability determine the 
statistical accuracy and reproducable precision of the 
measuring instruments but neglect to show whether or not the 
measuring instruments truly measure what was intended to be 
measured in the first placee It is therefore quite possible 
to have reliable statistical results that are not accurate 
in the sense of being valide 

Kendall and Buckland (Reference 35, pe 309) propose a 
definition of validity which takes into consideration the 
representativity of a sample and is otherwise Known as 
content validation by Kerlinger (see Reference 35 Je 
Validation is "a procedure which provides, by reference to 
independent sources, evidence that an inquiry is free from 
bias or otherwise conforms to its declared purposee In 
statistics it is usually applied to a sample investigation 
with the object of showing that the sample is reasonably 
representative of the populationeee"™ In this context a good 
test for vwalidity is to compare survey results with the 
finding of independent non-survey studiese If the results 
are similar, one can assume that the survey is probably 
valide 

The procedure that is followed in this paper is (a) to 
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test the reliability of the national surveys by determining 
if the measurements are consistent from survey to survey, 
and (b) to test the validity of the national surveys by 
comparing their results with the findings of independent 
studiese 

Although it is not particularly desirable, in the 
comparisons presented in this paper a vwariation of i190 
percent between the data of the national surveys and that of 
the control sources is sometimes used as the critical 
threshold value for questioning the reliability and the 
validity of the results of the national surveys. However, 
when data permit, significance tests are applied to discover 
if the differences noticed between percentages of the 
national surveys and the percentages of the "control" 
sources were chance differences due to sampling or note 


THE STATISTICAL DATA COMPARED 


The CORD Study National Survey results to be tested for 


reliability and walidity are described and data 
documentation provided elsewhere (Volume III )e Briefly, in 
these surveys, questions were asked to determine the 


participation and frequency of participation of respondents 
in outdoor recreation activitiese In 1967, 22 activities 
were covered; in 1969, 26 activities were covered and in 
1972, there were 28 activitiese (For more detail see TN 22.) 
Participation in hunting was obtained in 1967, 1969 and 
19723 however information on specific types of hunting 
(small game, large game, waterfowl) was obtained only in the 
i972 surveye Information on fishing was obtained in 1967 and 
1972 while participation in specific types of fishing (salt 
water, fresh water) was procured only in 1972. 

The other data sources used for the comparisons in this 
research paper are shown in Table le The National Survey 
results (Sources i-4) are compared with statistical data 
from other sources (5-10) described in the Appendixe The 
variations in the methods of data collection, years of data 
collection and age groups for which the data were collected 
are noted in describing the respective studiese 


COMPARISON OF HUNTING DATA 


The 1967, 1969 and 1972 8M surveys may be compared to 
provide some check on survey reliabilitye From Table 2, it 
can be seen that as far as hunting in general is concerned, 
the percentage of Canadians who hunted was fairly constant 
from 1967 to 1972- The maximum variation is 3%, ieee when 
1967 and 1972 percentages are comparede This difference is 
small enough that it probably reflects trends ower time so 
that the data on hunting seem reliabe even if the 3% maximum 
difference is highly significant (a@ > -002)-e When the data 
from the three national surveys are compared on a regional 
basis (see Table 2), the maximum differences between two 
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TABLE 1 


DATA SOURCES AND THEIR CHARACTERISTICS 


Sources 


1) 1967 CORD Study National 
Participation Survey 


2) 1969 CORD Study National 
Participation Survey 


3) 1972 CORD Study National 
Participation Survey 


4) DeAe Benson, "Fishing and 
Hunting in Canada 1961: A 
report on an Economic 
Surveye" 


5) "Statistics on Sales of 
Sport Fishing Licences in 
Canada 1966-1971". 


6) “Report on Sales of the 
Canada Migratory Game 
Bird Hunting Permit, 
Migratory Game 
Bird Harvest and Hunter 
Activity, 1971". 


7) "Travel, Tourism and 
Outdoor Recreation: 
A Statistical 
Digest 1972" 


8 ) PeGe Whiting, 
"A Comparison of 
Two Estimates of 
Angler Numbers 
in Canadae" 


se) 
—_ 


"National Survey of 
Fishing and 
Hunting 1970". 
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CHAPTER VII 


Type of Data 


National Survey by Personal 
interviews 


National Survey by Personal 
interviews 


National Survey by Personal 
interviews 


Labour Force Survey Sample 
of 30,000 


Figures on Licence sales 
and fees on the basis of 
information provided by 
the agency responsible 
in each jurisdictione 


Canadian migratory game 
bird hunting permits sold 
post offices across Canada 
and a Harvest Surveye 


Information drawn from 
variety of sourcese 


Compares 1972 National 
Survey data on Fishing, 
Recreation Fisheries 

Branch estimates of angler 
numbers in Canada and UeSeAc 
1970 Survey of Hunters and 
Fishermen datae 


8,700 sportsmen were 
interviewed, age 12 years 
and older who participated 
on any of three different 
days or more, or spent $7.50 
or more to go fishing or 
hunting during 1970. 
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Sources 


1) 


2) 


3) 


4) 


5) 


6) 


7) 


8) 


9) 
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TABLE 1 (contde ) 


Year of Data 


Fall of 1967 
Fall of 1969 
Fall of 1972 
February 1962 


April 1 to the 
next March 31 
for 66, 67, 68>, 
69,70,71 and 


Sales of permits 
from August 1971 
through January 
1972. 

10,603 useable 
mailed question-—- 
aires for 
Harvest Surveye 


Between 1965 and 
1971 


1967, 1970, 1972 
figures are 


comparede 


1970 


<><><><><> 


CHAPTER VII 


Age Groups to 


18+ 

18+ 

10 and 18+ 

14+ 

16+ to 19+ 
depending on 
agencies licencing 


practicese 


Unspecified 


Unspecified 


10+, 12+, 16+ and 
18+ 


12+ 
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surveys are all still less than 10%e Only two differences 
(out of 8) are significant (Canada: a > e0023 Ontario: a > 
e001 differences have the low probability of occurring as 
shown )e 

A comparison of the 1961 Canadian Wildlife Survey and 
the 1972 national survey results presents major problems 
from the point of view of evaluation of what differences 
meane The results shown in Table 3 may reflect an increase 
in participation in hunting in Canada between 1961 and 1972. 
It will also be noticed that the age groups of the two 
surveys compared do not coincide exactlye However ail the 
increases are less than 10% and so the 1972 national survey 
results seem plausible even if three differences out of four 
which were tested are highly significant (a < .001)- 
Considering that the wild life survey was a "focused survey" 
with much "tighter" sampling controls carried out and with 
many basic differences from the 1972 national survey, it is 
encouraging to see that differences are as small as they 
were found to bé€e : 

Comparison of 1970-71 Hunting Licences Sales Data and 
the 1972 National Survey data, Canada and Provinces, 
potentially allows the testing of survey validitye Thus it 
is fortunate that the two types of data on hunting in 
general (Table 4), the 1972 National Survey and the 1970-71 
Hunting Licences Sales yield similar results except for 
three Western Provincese In the western provinces there 
could be problems such as (1) unreliability of the national 
survey sample as far as provinces are concerned; (2) special 
circumstances surrounding Licence regulations; or (3) the 
failure of many people to actually hunt after purchasing 
hunting lLicencese Actually 5 out of 8 differences tested 
are significant, reflecting some basic discrepancy between 
the two data sourcese 

The same similarity between 1972 National Survey 
results and Licence data is seen in Tables 5-7e Yet here 
again similarity is associated with statistically 
significant differencese In Table 5 are shown the 
differences of percentages of participants in Small Game 
Huntinge It will be noticed that all differences are less 
than 10% and still three of these differences out of 8 are 
highly significante In Table 6, on Large Game Hunting, only 
one difference is greater than 10% but 4 differences out of 
8 are significante In Table 7, we notice that as far as 
Watertowl Hunting is concerned the 1972 National Survey 
results are not very different from Permit Sales estimatese 
None of the differences are close to _ 10%; however 3 
differences are statistically significante 

Comparison of UeSeAec 1970 National Survey data on 
Hunting and 1972 National Survey is of some valuee Canadian 
and UeSe statistics are often quite similare As can be seen 
from Table 8, there is similarity as far as percentage of 
hunters are concerned, even when the sexes are considered 
separately; all differences are much lower than 5% which is 
indeed surprisinge Significance tests could not be applied 
to figures by SEX, as data on the number of males and 
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TABLE 2 


PERCENTAGE OF CANADIANS 18 YEARS AND OVER 
WHO DID NOT PARITICIPATE IN HUNTING 
IN 1967, 1969, AND 1972, BY REGION, 
ACCORDING TO NATIONAL SURVEY DATA 


Maximum 
1967 1969 1972 Difference 
% # % # % # D¢ 
Canada 14 §986 13 2967 11 3002 3% 
Atlantic Provinces 21 625 i15 288 21 295 6 
Quebec 1i 1742 12 882 10 870 2 
Ontario 12 1972 10 1003 8 1018 4 
Manitoba 8 320 11 147 12 148 4 
Saskatchewan gn be 281 19 124 16 132 a 
Alberta 14 433 14 217 8 281 6 
British Columbia iS 603 16 306 15 288 1 
Standard 
Error of Confidence 
maximum Coefficient Significance 
difference Zc) = Level 
YD% x/YD% a 
Canada 12-00% 3-00 > e002 
( 67-69 ) 
Atlantic Provinces 3018 1-89 > e05 
(72-69 ) 
Quebec 1.248 1-35 > e017 
( 69-72) 
Ontario 1623 3e25 2 e001 
( 67-72) 
Manitoba 2085 1-40 > e016 
(67-72) 
Saskatchewan 3072 1-88 > e06 , 
( 67-69 ) | 
Alberta 2-90 2207 > .03 : 
( 69-72) | 
British Columbia 2-93 0.34 ep Ee ; 
(69-72) 
| 
<><><><><5> 
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TABLE 3 


COMPARISON OF RESULTS 
OF 1961 CANADIAN WILDLIFE SERVICE SURVEY 
AND 1972 NATIONAL SURVEY OF CANADIAN HUNTERS 
(SIGNIFICANCE TEST) 


CeWeSe National 
1961 Survey 1972 Survey 
Age 14+ Age 15+ 
(N=30,000) (N=3,255 ) Difference 
(a) (b) x = b-a 
Hunting (all types) 625% 112.1% 426% 
Small Game Hunting 3e4 8-0 426 
Large Game Hunting 308 520 1.62 
Waterfowl Hunting 2e8 3e4 0.6 
Standard 
Error Confidence 
of the Coefficient Significance 
Difference Z(c) = Level 
YD% x/YD% a 
Hunting (all types) 045% 10.22 < 2000 
Small Game Hunting 032 14.38 < 2001 
Large Game Hunting 032 3075 < 2001 
Waterfowl Hunting 032 1-88 e 06 
OL OLOTLO LO te 
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females in the UeSe sampte were not availablee However, as 
might be suspected, superficial similarity is contrasted by 
statistical differencee Three out of 4 tests show that the 
differences are highly significante 


SYNOPSIS ON HUNTING COMPARISONS 


Out of 60 comparisons on participation in hunting, only 
7% (4/60) of the differences computed are greater than 10%. 
However 47% of the a's computed (23/49) are significant 
statistically. There are few large differences hetween 
netional survey results and results from other sources, even 
though almost half of the differences are statistically 
different. In more detail, the national survey data on 
Canadian hunters are consistent for the three years: 1967, 
1969, and 1972. They are not very different from the data 
collected in a comparable survey in 1961. The 1972 results 
are also quite similar to to estimates of hunters made from 
Licences Sales data, and that is true for all types of 
hunting: small game, big game, and waterfowl huntinge The 
percentages of Canadian hunters given by the i972 survey are 
also very similar to 1970 UseSeAe dataec In sum, it seems 
that the hunting data are resonably reliable and valide 
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TABLE 4 


COMPARISON OF PERCENTAGES OF CANADIAN HUNTERS (ALL TYPES ) 
ACCORDING TO 1970 — 1971 LICENCE SALES 
AND 1972 NATIONAL SURVEY, CANADA AND REGIONS 
( SIGNIFICANCE TESTS ) 


1970-71 1972 

Hunting National 

Licences Survey 

Sales, 16+* Hunters, 16+ Difference 

(u)% (X)% # X% —- u% 
Canada 15<8 10.8 3155 —-5e0% 
Atlantic 18.5 1524 312 —-3el 
Provinces : 
Quebec 9e3 10e3 917 1-0 
Ontario 11.5 923 1069 —-20e2 
Manitoba 13-6 11.1 153 —-2e5 
Saskatchewan 2662 1367 139 ~12-5 
Alberta 34.5 926 260 —-24-9 
British 2868 12.8 305 —-16-0 
Columbia 
Confidence 
Coefficient 
YD%=100 ( Zc=X—u% )/ Significance 
(pq/n )**1/2 YD% Level a 

Canada 00e55% -9.09 < e001 
Provinces 
Quebec 1-00 1-00 > eJdl 
Ontario 0.839 —22e47 > e001 
Manitoba 2e53 —-0.99 > 32 
Saskatchewan 2092 4228 < .001 
Alberta 1.82 13-68 < 001 
British 1.92 8233 < 2001 
Columbia 


* Licence sales % were calculated from absolute number of 
Licence Sales given in Statistics Canada, TRAVEL, 
TOURISM, AND OUTDOOR RECREATION: A STATISTICAL DIGEST, 
1972, CATe 66-202, Table 8e7 pe 94-e Population 16 years 
and over is given in 1971 Census of Canada, Statistics 
Canada, CATe 92-715/7166 
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TABLE § 


COMPARISON OF PERCENTAGES OF CANADIAN SMALL GAME HUNTERS 
ACCORDING TO 1970 -1971 LICENCE SALES 
AND TO 1972 NATIONAL SURVEY, CANADA AND PROVINCES 
(SIGNIFICANCE TEST) 


1970-71 1972 

Small Game National 

Hunting Survey 

Licences Small Game 

Sales, 16+* Hunters, 16+ Difference 

(u )% (xX )% if Xx% —- u% 
Canada Fel Ted 3155 0 = 6% 
Atlantic 703 j 12.5 312 §e2 
Provinces 
Quebec Sel ve ge? 917 2ei 
Ontario Tel 704 1069 003 
Manitoba 6e7 665 153 -0.2 
Saskatchewan 9-9 9.4 139 -0.-5 
Alberta Te 8 Se 8 260 -2 ® 0 
British 10.7 606 305 —4.1 
Columbia 
Confidence 
Coefficient 

YD® = 100 (Ze = X%— Significance 

(pq/n)**1/2 u%)/YD% Level a 
Canada O«e 45% 1.33 > e018 
Atlantic 1.87 2078 > e005 
Provinces 
Quebec Oe 83 2e 53 > eO1 
Ontario 0-78 0-238 > e70 
Manitoba 2200 —-0.10 > 292 
Saskatchewan 2-47 —0.20 > 084 
Alberta 1.45 —-1.38 > 16 
British 1.41 —2e91 > e003 
Columbia 


* See remarks below Table 4. 


<><><><> <> 


TN 24 CHAPTER VII page 442 


TABLE 6 


“COMPARISON OF PERCENTAGES OF CANADIAN LARGE GAME HUNTERS 


ACCORDING TO 1970 — 1971 LICENCE SALES 
AND 1972 NATIONAL SURVEY, CANADA AND REGIONS« 
(SIGNIFICANCE TEST) 


1970-71 1972 
Large Game National 
Hunting Survey 
Licences Large Game 
Sales, 16+* Hunters, 16+ Difference 
(u)% (X)% # Xx% —- u% 
Canada Jel . Sel 3155 —-220% 
Atlantic 1066 9.0 312 —-1.6 
Provinces 
Quebec 3e9 4e1 917 0.2 
Ontario 3e1 3e7 1069 0e6 
Manitoba 609 605 153 —-0.4 
Saskatchewan 1603 7°09 139 —-8e4 
Alberta 14.3 308 260 —-10-5 
British 18.1 8e2 305 -9.9 
Columbia 
Confidence 
Coefficient 
D%® = 100 (Zc = X%& Significance 
(pq/n)**1/2 u%)/YD% Level @ 
Canada 0245% —4244 < 2001 
Atlantic 1-61 -0-99 > «32 
Provinces 
Quebec 0-63 0032 > e174 
Ontario 0-55 1-09 > Sear A | 
Manitoba 2-00 —-0-20 > «84 
Saskatchewan 22628 —-3.68 < -001 
Alberta 1-18 -8.90 < e001 
Eritish 1-58 —-6e27 < e001 
Columbia 
* See remarks below Table 4e 
OL OTOTO LO: 
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TABLE 7 


COMPARISON OF PERCENTAGES OF CANADIAN WATERFOWL HUNTERS 
ACCORDING TO 1970 — 1971 LICENCE SALES 

AND 1972 NATIONAL SURVEY, CANADA AND REGIONS 
(SIGNIFICANCE TEST) 


1971 
Migratory 1972 
Game Bird National 
Hunting Survey 
Permit Waterfowl 
Sales, 16+* Hunters, 16+ 
Cu )% (xX )% # 
Canada 206 3e5 3155 
Atlantic 306 305 312 
Provinces 
Quebec 1062 205 917 
Ontario 203 306 1069 
Manitoba §e4 502 183 
Saskatchewan 6206 Te2 139 
Alberta 5-5 5e0 260 
British 220 2-6 305 
Columbia 
YD%=100 Confidence 
(pq/n)**1/2 Coefficient 
(Zo = X¥%— 
u% )/YD% 
Canada 032% 2081 
Atlantic 1-05 —-0.210 
Provinces 
Quebec 0.55 2036 
Ontario 0e55 20e36 
Manitoba 1.79 —-O.11 
Saskatchewan 2019 0027 
Alberta 1.34 —-0-37 
British 0-89 0-67 
Columbia 
* Permit Sales % were calculated from 
Permit Sales given in 
Notes, Noe 28, July 1972- Population 
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Difference 
X% — u% 


Significance 


Level a 


e005 
092 


001 
Ol 
091 
86 
o71 
e50 


VVVVV Vv V 


absolute number 


of | 


Canadian Wildlife Service, Progress > 
16 years and over is 
given in 1971 Census of Canada, Statistics Canada CAT. Noo > 
92-715 and 92-716. These figures are given in Appendixe 
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TABLE 8 


‘PERCENTAGES OF HUNTERS IN CANADA (1972 NATIONAL SURVEY ) 


AND IN UeSeAe 


HUNTING 
Total 
Males 
Females 
SMALL GAME 
Total 
Males 
Females 
LARGE GAME 
Total 
Males 
Females 
WATERFOWL 
Total 
Males 
Females 


HUNTING 
SMALL GAME 
LARGE GAME 


WATERFOWL 
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ACCORDING TO SEX 
(SIGNIFICANCE TEST) 


UeSeAc Canada 
1970, 12+ 1972, 12+ 
(N=8,700 ) ( N=3681 ) Difference 
(a) (Cb) x= b-a 
9e2% 12.0% Ze 8% 
18e3 21-3 3e9 
1el ; 3e2 Zell 
7e5 9.0 1.25 
15-0 16-2 1.2 
Oe7 2e3 1<e6 
5290 4e7 —-O0e3 
9-9 8e4 1.5 
026 1.3 0.7 
1.9 3e7 1.8 
3e8 Jel 3e3 
Ool 0e4 0-3 
Standard Confidence Signifi- 
error of the coefficient cance 
difference (Zc = x/ Level 
YD% YD% ) a 
e 59% 4275 < e001 
055 2073 > e006 
245 —-0« 67 > 50 
e32 5e 63 < e001 
q><>< >< <> 
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COMPARISON OF FISHING DATA 


Regarding the 1967, 1969 and 1972 national surveys, it 
is unfortunate that only the 1972 survey gives data on 
detailed types of fishing. However, as far as fishing in 
general is concerned, the percentages of Canadians who 
fished is not very different in 1967 and 1972, as can be 
seen in Table 9. The variation between the results of the 
two surveys being 4%, it could be concluded that the data on 
percentages of Canadian sport fishers in general is fairly 
reliable even though this 4% difference is significant at a 
e001 levele ‘ 

However, there are greater variations between the data 
of the 1967 and 1972 surveys, when one compares results by 
provinces, as seen in Table 10. Still, the national data, 
even when disaggregated to provincial levels, is fairly 
consistente There is only one province (Manitoba) for which 
the percentages for the two surveys differ by more than 10%; 
this may be due toa change in licensing regulations or 
enforcement practicese However, some differences (even if 
not large) are significant (Canada: a < ©0013 Quebec: a < 
e001; Manitoba: a < .OO1 )e 

Generally the results in Table 11 indicate poor fit 
between 1961 Canadian Wildlife Service Survey data and 1972 
National Survey datae It is not impossible that the 
percentage of Canadian fishers has increased from 11% in 
1961 to 28% in 1972, but this explanation is not Likely. 1G 
should be noticed that the differences are all highly 
Significant (@ < .001) in Table 11. What is important is 
that one may reasonably hypothesize that the differences are 
partly due to differences in definitions of "fishing". In 
fact, for the 1961 CeWeSe survey, a person was considered a 
fisherman if one hour or more was spent sport fishing during 
19613 for the 1972 national survey a respondent was 
considered as having fished in 1972 if he said that he had 
fished at least once in the past yeare The 1961 CeW-Se 
survey definition of fisherman seems much more restrictive 
than the national survey definitione There is good reason to 
believe that wives who accompanied husbands on fishing trips 
would often say to national survey interviewers that they 
fished (they had been fishing) in the past year. Regardless, 
one or the other (or both surveys) are not valid in all 
Likelihoode 

Comparison of 1970-71 Estimate of Anglers from Sport 
Fishing Licences in Canada and of percentage of fishers 
according to 1972 National Surveys results in the figures 
given in Table 12. Then it can be seen that the two 
estimates of the numbers of Canadian fishermen differ 
considerably, not only when provinces are considered but 
even when Canada as a whole is considerede The i1970-71 
estimates of anglers from licences sales are much closer to 
the 1961 C.W.S. survey figures (see Table 11) than to the 
1972 national survey percentages. 

It is highly tempting to conclude from the Table 12 
comparisons that the figures confirm the hypothesis that the 
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number of people fishing is much greater, probably twice as 
great as the number of Canadians who purchase fishing 
Licencese (Purchasing licences should not be taken literally 
because "Licence purchases" for Ontario are estimates after 
1970 since no fishing licence is required by residents. ) 

Of course another and better explanation for the large 
variation is that the differing definitions of fishermen in 
the two surveys mean the survey results should not agree! 

As with hunting, comparison of US 1970 National Survey 
Data on Fishing and 1972 National Survey data is of some 
relevance in evaluating Canadian surveySe AS can be seen 
from Table 13, for salt water fishing, Canadian and American 
percentages are almost identical for both sexese But when 
fishing in general and freshwater fishing in particular are 
considered, the Canadian percentages are much higher than 
the American percentagese It may be tempting to conclude 
that, in fact, the percentages of Canadians who fish, 
particularly in freshwater, are greater than the percentages 
of Americans; but these facts may well indicate nothing more 
than the differences in the questions askede 


SYNOPSIS OF FISHING COMPARISONS 


Most comparisons that were presented in this section 
leave the reader with the impression that the national 
surveys overestimate the percentage of Canadians who g0 
fishinge The 1972 survey gives @ percentage of Canadian 
fishers about three times greater than the percentages given 
by a similar survey conducted in 1961 by the Canadian 
Wildlife Servicee The 1972 percentages of Canadians who fish 
are about twice as high as percentages given by a 1970-71 
estimate of Anglers from Licences Salese The 1972 
percentages of Canadian fishers are much higher than 1970 
American figurese This may be explained by speculative 
causes such as (1) participation in fishing has greatly 
increased in the last ten years, (2) people who go fishing 
without a licence are as numerous as those who get a Licence 
and that in fact (3) freshwater fishing is more popular in 
Canada than in the UeSeAe However since the results of the 
1967 and 1972 surveys are very similar, it appears that the 
"overestimate" is obtained consistently by national surveyse 
So it can be concluded that the survey results on fishing by 
Canadians are reliable (consistent), but their validity 
should be questionede 
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TABLE 9 


PERCENTAGE OF CANADIANS 18 YEARS AND OVER 
WHO DID PARTICIPATE IN FISHING 
IN 1967, 1969, 1972 
ACCORDING TO CORD NATIONAL SURVEYS 


Fishing Salt—Water Fresh-Water 


Fishing Fishing 

1967 (N=5986) 27% = =- 
1969 (N=2967) s ~ " 
1972 (N=3002) 31% 6% 29% ) 
Maximum Difference (x) 4% ) 
Standard Error of the 
Maximum Difference i 

(YD% ) 120% | 
Confidence Coefficient | 
(Zc=x/YD% ) 4.00 | 
Level of Significance < 2.001 

<><><><><> 
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TABLE 10 


PERCENTAGE OF CANADIANS 18 YEARS AND OVER 
WHO DID PARTICIPATE IN FISHING 
IN 1967 AND 1972 
BY REGION (ACCORDING TO NATIONAL SURVEYS DATA) 


| 1967 1972 Difference 
, % it % it x 
Canada 27 5986 31 3002 4% 
Atlantic 33 625 31 295 2 
Provinces 
Quebec 21 1742 29 870 8 
Ontario 29 1972 30 1018 1 
Manitoba 17: ..320 38 148 21 
Saskatchewan 26 291 31 132 5 
Alberta 24 433 29 251 5 
British 31 603 39 288 2 
Columbia 
. Confidence 
: Standard error coefficient Significance 
| of Difference Ze = x/ Level 
: YD% YD% at 
| Canada 1-0% 4.00 < 2001 
| Atlantic 3e30 0-61 > 54 
| Provinces 
- Quebec 1.76 4.55 < .001 
Cntario 1-76 02.57 > 057 
Manitoba 4224 4.95 < e001 
Saskatchewan 4-71 1.06 > e29 
Alberta 3048 1-44 > e14 
British 3e28 e61 > 054 
Columbia 
OL OTOTOLO, 
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TABLE 11 


COMPARISON OF PERCENT RESULTS 


OF 1961 CANADIAN WILDLIFE SURVEY 
AND 1972 NATIONAL SURVEY ON CANADIAN FISHERS 


Fishing Fresh-Water 


CeWeS 1961 Survey 
Age 14+ (N=30,000) 


National 1972 Survey 
Age 15+ (N=3255 ) 


Difference 
x=b-a 


Standard Error 
of Difference 
YD% 


Confidence Coefficient 


Zc=x/YD% 


Significance Level 


at 
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Fishing 
10-8 % 10.4% 
28 «3% 27-21% 
17-25% 16.7% 
©63% °63% 
272e78% 26251% 
< .001 < 2.001 
<>< ><><> <> 
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Salt—Water 
Fishing 


0 32% 


12-50% 


< .001 
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TABLE 12 


COMPARISON OF PERCENTAGES OF CANADIAN FISHERS (ALL TYPES) 


ACCORDING TO 1970 - 1971 ESTIMATE OF ANGLERS 
FROM SPORT FISHING LICENCES AND 1972 NATIONAL SURVEY, 
CANADA AND REGIONS 


1970-71 
Estimate of 1972 
Anglers from National 
Licences Survey 
Sales, 16+* Fishers, 16+ Difference 
(u )% (xX )% # X% — u% 
Canada 14.2 2725 3155 13.3% 
Atlantic Ljel 2520 312 729 
Provinces 4 
Quebec 13-2 2526 917 12.4 
Ontario 13-20 29el 1069 16.1 
Manitoba 13-8 | 3067 183 1609 
Saskatchewan 17e3 2509 139 8e6 
Alberta 12-1 2426 260 12-5 
British 14067 32-5 305 17.8 
Columbia 
Y g D%®=100 Ze = (X - Significance 
(pq/n)**1/2 u%)/YD% Level a 
Canada 0-78% 17-05 < 2.001 
Atlantic 2245 3-222 < 2001 
Provinces 
Quebec 1245 8-55 < e001 
Ontario 1.38 11.67 < 2001 
Manitoba 3073 4.53 < e001 
Saskatchewan 3072 2e31 > e002 
Alberta 2067 4.68 < e001 
British 2268 , 6064 < e901 
Columbia 


* Licence sales % were calculated from absolute number of 
Licence Sales given Statistics Canada on Sales of Sport 
Fishing Licences in Canada 1966 - 1971, Environment 
Canada, September, 1973- Population 16 and over is given 


in 1971 Census of Canada, Statistics Canada, CA( 
G2-716-6 These figures are given in Appendixe 
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TABLE 13 


PERCENTAGES OF FISHERS IN CANADA (1972 NATIONAL SURVEY) 
AND IN UeSeAe (1970 NATIONAL SURVEY OF FISHING AND HUNTING ) 


ACCORDING TO SEX 


UeSeAc 1970 Canada 1972 


Age 14+ Age 12+ 
(N 8700) | (N 3681). 
(a) (b) 
FISHING x) 
Total 21.24% 31.25% 
Males 3207 44.5 
Females liel : 1964 
SALT-WATER FISHING : 
Total - Gel 5e5 
Males 929 8e4 
Females 226 1 2e7 
FRESH-WATER FISHING 
Total 18.9 3063 
Males 29el1 42.8 
Females 9e7 18-6 
Std Error of Confidence 
difference Coefficient 
YD% Zco=x/YD% 
FISHING ° 84% 12.02 
SALT-WATER FISHING 245% 1-33 
FRESH-WATER FISHING 284% 13.57 
<><><><><> 
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GENERAL CONCLUSION 


F The national surveys data seem quite reliable, ie@e 
consistent from one survey to another when hunting and 
fishing data are comparede However, the hunting data seem 
more valid than the fishing data when compared to other data 
sourcese As far as hunting data are concerned, only 7% 
(4/60) of the comparisons show differences greater than 10%; 
for fishing data, 45% (13/29) of the comparisons’ show 
differences greater than 10%. While 47% (23/49) of the 
significance levels are less than .05 for hunting, in the 
case of fishing it is 74% (17/23) of the significance levels 
that are less than e053; ieee 74% of the differences noticed 
from the comparison of fishing data are highly significant. 
While the hunting data seem reliable and valid, the fishing 
data raise many questions which the data Limitations do not 
permit one to answere 
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TABLE 15 


ABSOLUTE NUMBERS FROM WHICH PERCENTAGES OF HUNTERS 
WERE ESTIMATED FROM LICENCES SALES 


Hunting 
1971 Population Licences Sales 
Regions Age 16t* 1970-71*x 
Canada 14,742,225 2,322,800 
Atlantic Region 1,336,335 247,700 
Quebec 4,112,625 382,800 
Ontario § »,344,935 617,000 
Manitoba 681,560 93,000 
Saskatchewan 625,380 163, 800 
Alberta 1,078,655 372,200 
British Columbia 1,531,715 440, 400 
Small Game Big Game Waterfowl 
Hunting Hunting Hunting 
Licences Sales Licences Sales Permit Sales 

Regions 1970-71** 1970-71** 1971**x 
Canada 1,044,300 1,048, 800 395,622 
Atlantic 98,100 141,000 49,414 
Region 
Quebec 211,700 160,300 49,001 
Ontario 378,500 165,900 125,010 
Manitoba 45,800 47,200 37,668 
Saskatchewan 61,700 102,100 42,525 
Alberta 84,100 154,500 61,007 
British 163,300 277,100 30,897 
Columbia 
* Source: 1971 Census of Canada, Statistics Canada, CATe 

Noe 92-715 & 92-716 
** Source: Statistics Canada, Travel Tourism and Outdoor 

Recreation: A Statistical Digest, 1972, CATe Noe 

66-202, Table 867; pe 94-6 
*** Source: Canadian Wildlife Service, Progress Notes; Noe 

28,3 July 1972. 

<><><><><> 
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TABLE 16 


ABSOLUTE NUMBERS FROM WHICH PERCENTAGES OF PERSONS WHO FISH 
WERE ESTIMATED FROM LICENCES SALES 


Estimated Anglers 


Age 16+ 
Regions 1970-71 (Resident ) 
Canada 29 O25, 000 
Atlantic Provinces 228,000 
Quebec 543,000 
Ontario 696,000 
Manitoba 94,000 
Saskatchewan 108,000 
Alberta ‘ 131,000 
British Columbia. 225,000 


Source: Statistics on Sales of Sport Fishing Licences in 
Canada 1966-1971, Recreation Fisheries Branch 
Department of the Environment, Ottaway, Ontario, 
September 1973-6 
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APPENDIX 
THE SIGNIFICANCE TESTS 


To determine the level of significance two different 
approaches were used depending on whether the comparison was 
done between two samples or between a sample and Licence 
sales datae 


Comparison Between Two Samples 


The level of significance a is given by the Normal 
Curve Area Table once a confidence coefficient Zc = x/Y D% 
is obtained, where x is the difference observed between two 
results, ieee between the 1967 percentage of participants 
and the 1972 percentage of participantse Incidentally, nl 
and n2 are large so one is not concerned with the Student's 
T distributione 

The standard error of the difference YD% is obtained by 
the use of the following formula for proportions: 


¥(D)% = 100 (Cpq((1/n1 )*+01/n2)) )**1/2 
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WHERE p is the total percentage of participants in the 
two surveys compared, 


a=i1- py» 


ni number in first sample, and 


n2 


number in second sample. 


Comparison Between a Sample and Licence Sales Data 


In this case, the significance level is also given by 
the Normal Curve Area Table once a confidence coefficient 
Z(c) has been obtained by the use of the following formula: 


Z(c) = (X% — u%)/YCD)% 


WHERE X% is the percentage of participants in 
the national survey. 


u% is the percentage of licence holders, and 
YD%® is the standard error of the difference 
between the national survey results and the 


proportion of Licence holders. 


The standard error of the difference YD% was 
calculated by the use of the following formula: 


YCD)% = 100 (pq/n )**1/2 


WHERE p is the percentage of participants in the survey sampli« 


q= 1- p>, and 


number in survey samplee 


n 


If the level of significance of .05 was chosen a 
difference was considered as Significant, not due to chance, 
if the probability of the difference is less than .053 ieee 
there are less than 5 chances in 100 that the difference is 
a chance difference due to samplinge For example in Table 1, 
the maximum difference of 3% which is noticed between the 
1967 and 1972 national surveys is significant at > .002, 
ieee there is little more than 2 chances in 1000 that the 
3% difference noticed is due to Samplinge 
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CORD STUDY TECHNICAL NOTE 38 
QUICK ESTIMATES OF rie poet oe BENEFITS 
Je Beaman, Le Lehtiniemi, VeKe Smith 

ABSTRACT 


This paper shows that if a travel model where visitor 
flow equals the product of other variables and -kD(o0,d) 
WHERE C is an “impedance of distance" constant and Dlosd) is 
one way travel distance from origin to destination for main- 
destination trips of a given purpose the, percapita consumer 
surplus values can be computed very easilye It is computed 
from the cost per 2-way mile of travel, C, and ke Percapita 
consumer surplus is simply C/ke Thus, quick estimates of 
primary benefits, consumer surplus, are obtained simply by 
getting the site "demand function" without going through the 
approach described by Knetschy, Clawson and others to get an 
aggregate demand functione 

Some reasons for using the quick estimation approach 
endorsed are presented in relation to (i )theoretical 
justification for the travel model suggested and (2)avoiding 
arbitrary decisions that must be made using the usual 
approach to defining Clawson-Knetsch-Hotelling demand 
functione The matters of (1)including time-bias in the 
model, (2)the effect of admission fees on identifying time- 
bias constraints, and some other issues are pursuede 


INTRODUCTION 


The purpose of this paper is to derive a simple 
relationship between the primary benefits associated with 
outdoor recreation and a class of demand relationships which 
have been used with considerable success in origin- 
destination recreation studies for Canadian Outdoor 
Recreation Demand (CORD)e This derivation is especially 
useful because it follows directly from the assumed per mile 
costs of travel (including time) and the estimated effect of 
these costs on visitor use of the recreation sitee 

In what follows, attention is focused on the demand for 
a single recreation sitee Substitute facilities are assumed 


- non-existent and the total level of use of the area is 


assumed to be less than its economic carrying capacitye (For 
a definition of the meaning of economic carrying capacity 
and its relationship to economic carrying capacity, see 
Reference 19.) Since the economic rationale for the travel 
cost model has been developed in a number of previous 
studies, (see Reference 53) the focus of this paper is 
directed to the derivation of a consumer surplus measure for 
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the total population of users for the recreation site of 
intereste 


DEMAND AND CONSUMER SURPLUS 


An individual demand curve is the locus of maximum 
prices that individual would be willing to pay for each 
corresponding level of consumption of the good or service 
involwede Given a single price rationing system, the net 
benefits to each individual from the provision of the 
service of interest at a particular price is the sum of the 
excess willingness to pay at each quantitye This benefit is 
the area under the income-—compensated demand curve less the 
amount paid for the good or servicee These net benefits are 
generally designated consumer surplus and will be the 
benefit measure utilized here. 

Consider a class of demand functions for a particular 
recreation site as given in Equation le 


(1) VCj) = exp(-z£ TC(J)) #£(205)) 
WHERE 


Wj) = the number of trips to the site from origin j 
during a given time period, 


TCC j) = the total per unit (trip) costs associated 
with engaging in recreation at the site from origin j, 


ZCj) = set of additional determinants of demand for 
origin jy 
£E= change in demand with a change in cost per visit 


(CdV/@ TC)/V(j))e 


If total unit costs are proportional to distance and the 
proportionality factor is constant for all origin zones, 
then Equation 1 can be re~-expressed in terms of distance. 
This transformation is relevant to many CORD studies, since 
the models have been estimated in terms of distance rather 
than per unit total costse Thus assume that total unit costs 
are given as the sum of travel costs and time costs as in 
Equation 2e 


(2) TCC jG) = Cod(j) + teal j) 
WHERE 
C = per mile travel costs, 
d(j) = roundtrip distance to the site from origin jy», an 


t = time costs per mile. 
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Equation 1 can be re-written in terms of distance as follows: 


(3) V(JjJ) = exp(-9 d(j)) £(Z(035)) 


WHERE: @ = £(C + t)e 


The total use of a given site with given costs can then be 


derived by summing Equation 3 ower all origins as in 
Equation 4. 


(4) TU = OE expl(-@ d(j))*#°0Z( j)) 
WHERE the sum is over Je 


Consider the effects of a change in unit costs through the 
introduction of an admission charge, Ae The changed total 
use is given in Equation 5. 


(5) DTU = dE expl—8 d(j) — EA)*£C ZC §)) 


Since A is constant, by assumption, for all origin zones DTU 
can be expressed in terms of TU as follows: 


(6) DTU = exp(-EA) OE exp(-@ d( j))*£#£(0Z(j)) = exp(-—EA)TU 


[f we wish to measure the consumer surplus effects of 
s>hanges in the admission fees, then we need only integrate 


squation 6 over a prespecified region for A as in Equation 
le 


7) CS = I( t TU exp(-tA), dA) = TU(1 = exp(-£MP ))/£ 
HERE the intergral is from 0 to MP (maximum price )e 


-t should be noted that in Equation 7 it is assumed that all 
ther components of price remain unchanged and one considers 
‘ariations only in the admission chargee Movement from 
C+t)aCj) to (Ctt)d( j)+MP represents an increase in unit 
-osts and therefore a decrease in use (DTU) and associated 
oss in consumer surplus (CS )e 

On a per trip basis (PCS), the consumer surplus for 
his model is found to be a function solely of the estimated 
arameter, £, and the maximum admission charge, MP, as given 
n Equation Se 


8) PCS = £ —- exp(-E£ MP)/E 

t is therefore readily computed for any Biven sitee Since 
any of the CORD models are specified as in Equation 3, 
irect estimates of E are not availablee Rather one must 
pecify the unit travel and time costs (ieee C and td)e 


guation 8 can be re-written in terms of ® as follows: 


9) PCS = £ = exp(-MP ¢/(t+C))/(g/( ttC) ) 
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IMPLICATIONS 


The results of this derivation suggest that for a given 
class of demand functions it is possible to readily estimate 
the consumer surplus losses (or gains) associated with 
increases (or decreases) in the unit costs of a recreation 
trip that are independent of distancee Moreover, these 
estimates can readily be derived from estimated equations 
based on distance and not total unit costs provided the 
travel and time costs can be assumed to be proportional to 
distancee Since the models, in which estimates of Ez are 
based, have been reasonably successful in explaining visitor 
use patterns to Canadian Parks, the simple computational 
formula provided should offer a benchmark for ready 
calculation of the potential benefits associated reductions 
to user feese 

It is nonetheless true that this derivation requires a 
number of simplifying assumptionse Among the most important 
of these are the absence of substitute facilities and on- 
site congestione In addition, the measure of consumer 
surplus are only relevant for the same dollar change in the 
unit costs of users at all origin zonese If the price 
structure changes differentially to consumers at different 
origin zones, then the analysis is of limited relevancec 


APPENDIX 
Jay Beaman and Ne Hung Do 


In the preceding paper a number of matters heave not 
been carried to their "Logical" end because of objects that 
can be raised while other considerations were not pursued 
because their pursuit would constitute a deviation from the 
main theme of the paper. Rather, it was decided to have an 
appendix to the paper where in a point by point way the 
lines of consideration that were only raised in the paper 
are pursued to some extente 


OBTAINING ACTUAL PER CAPITA CONSUMER SURPLUS VALUES 


In the paper, formulae were developed that depend on 
some "highest price that would be paid". Defining such a 
price is important if one is using distance functions such 
as the inverse of distance to some power for which integrals 
may not convergee But for the exponential function, if MP, 
the maximum price, is reasonably large so that most of the 
area under the demand curve occurs "before" MP, then one 
might as well consider the results obtained by letting MP 
approach infinitye 

If one simply lets MP approach infinity then one 
obtains consumer surplus, CS, as: 


(Al) CS = t TU exp(—-£ £)) dA = TU/E 
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WHERE the integral is from 0 to "infinity". 


To obtain this on a per visit basis (PCS), one divides by 
TU, thus the consumer surplus is given by: 


(A2) PCS = 1/f£ 


The per capita consumer surplus for the model depends solely 
on the estimated parameters ELE which in reality can be 
interpreted as: 


(A3) £ = "resistance of distance"/cost per mile of travel 


The interpretation B@iven above is based on the fact that 
when one goes from a price formulation to the distance 
formulation which is usually used in carrying out 
regressions, one has; 


(A4) exp( i A) = exp(-l A/C) 


WHERE in terms of TN 14, lL is the impedance of distance and 
where Cis cost per two-way mile (further described on 
Figure 1 )e. So, as described by Beaman and Lehtiniemi (see 
Reference 2) if one has ltl values such as those given in 
Figure 1, one can get quick estimates of per capita consumer 
surplus such as those shown by using: 


(AS) PCS = 1/£ = C/l 


ARC ELASTICITY CONSIDERATIONS 


One may note that with the type of demand function 
defined earlier, the arc elasticity of demand with respect 
to admission fees is less than one, or in other words 
reflects inelasticitye Since the arc elasticity is defined 
ass 


E& = Variation of demand * Average of price 


Variation of price * Average of demand 
one obtains for the model: 
G&G = (Cexr(-LEA)TV—-TU/A)((( AtO )/2 )( exp( —LA )TUFTV/2 ) 


= (exp(—-£A )—-1)/(Cexp(-tA + 1) 


which is less than zero and greater than -1 but approaches 
-i as A becomes lLlargee 


As such |&| < 1 means that the demand is inelastic with 
respect to admission fees regardless of the value of A and 
Qne This result could be considered consistent with the 
findings of McConnell and Duff (see Reference 40 )e They 
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report that the demand function is more elastic with respect 
to travel costs that it is with respect to admission 
chargese However, the problem is what is the elasticity 
with respect to travel costs? If one Bays that the A 
considered previously is travel costs rather than admission 
charges and they use the same demand function or introduced 
an AT component for 


DTU = exp(-E( A + AT))TU 


then obviously travel and admission cost elasticities are 
the same when comparable charges in A and AT are considerede 
Only if time bias is explicitly considered (as is done 
subsequently) does one obtain results on which the statement 
of McConell and Duff can be checked. 


POINT ELASTICITY AND THE MARGINAL EFFECT 
OF A CHANGE IN PRICE ON DEMAND 


Point elasticity for the demand function defined by 
Equation 6 is: 


(a[ DTU ]/al A])/CA/DTU) -E A exp(-t£ A) TU/DTU 
—-E A (DTU/DTU ) 


= -E A 


However, it is considered that this negative elasticity, 
which goes to infinity as price is increased, is deceptive. 
The reason is that one should consider the following: 


MC 


the Limit of proportional change in DTU/change in price 
(@ DTUe@ A)/DTU = —E. 


The fact that MC is equal to a constant means that 
charging a dime more causes the same proportional change in 
DTU whether A has been doubled from 10 to 20 cents’ or 
increased from $100.00 to $100.10. This latter way of 
expressing thow demand is changing with price is crystal 
clear in terms of what is meant: a certain percentage change 
in demand is associated with a particular change in prices 
What is really important to note is that because MC does not 
change with A, behaviourally it can be considered to reflect 
a person's belief that (say) 10 cents is Just as important 
if one spends $1.00 on a certain type of trip as if one 
spends $500 on that kind of tripe Intuitively one may feel 
that if one was willing to spend $500, ten cents would not 


mean as much as it does when it amounts to a 10% increase in | 


trip costse So there may be good behavioural grounds on 
which to object to the demand function based on the exp(-£ 
D) distance function 

How MC varies for other distance functions can be 
inferred from Figure 2 of ‘TN 14. This is because the 
definition of MC here is basically equivalent to the 
definition of “impedance of distance", IDF(d), in TN 14.6 
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For example the traditional distance part of a gravity 
function, d**a,y has an impedance function (1 —- a)/D which in 
terms of cost would imply that for people who travel further 
than D an added cost, such as an admission cost A, would be 
introduced into the denominator of the impedance function as 
follows: 


MC = (1 -— a)/(D + A/C) a > 1 
or in cost terms where Ai = CD: 
MC = C(1 -— a)/C AL + A) fora>?i 


For this function ten cents does have less effect on 
behaviour when total trip expenditures are large rather than 
small! 


From a practical perspective three points which are not 
introduced in more detail later show the tack of immediate 
importance of the issues just raisede Firstly, over’ the 
range of distance in which "trip types" can be considered 
constant all distance functions are fairly similar in shape 
and can even be considered to have “relatively constant" 
average impedance of distancee One can see the Similarity of 
distance function in Figure 1 of TN 14. The problem just 
cited is a collinearity problem and relates to a second 
concerne Because of collinearity problems it is not tlikely 
that a structurally adequate trip distribution model can be 
identified using the data usually employed to calculate such 
modelse Getting a correct consumer surplus depends on 
getting a correct model, particularly when dealing with non- 
isolated sitese Clearly in the context of TN 11, even if a 
correct percapita consumer surplus can be computed, there is 
a need to assess (1) to which sites it will be distributed 
and (2) how total visits made in a system will be influenced 
by the "supply generates demand" effecte The third problem 
is that the formulation presented is for isolated sitese To 
the knowledge of these authors only Knetsch and Cheung have 
used a systems model (albeit a rather naive one) to geta 
demand function for a proposed non-isolated sitee 


TIME BIAS 


It can be noted that the demand function defined in 
Equation 3 has a term which relates to value of timee This 
is because of the problem of time bias (see Reference 9 )e 
However actually estimating time bias effects raises some 
interesting estimation problemse For example, the Cesario 
model (see TN 4) can be written as follows with hard time 
considered: 
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CA6) Vlosd) = AC d)E(O) exp(-E(DCo,d) + ktlo,yd) + C(d)/c) 


WHERE t(o,d) travel time, 


i 


C(d) admission charge, and 


c = per two-way mile travel costs. 


But, since using an average velocity figure, Ve tlosd) can 
be re-written as: 


(A7) ktCosd) = k(1)D(0,d) 
WHERE k(1) = kve 


Now by taking logs of both Sides of the equation, as one 
would to carry out a regression, one gets: 


CA8) ln Vloyd) = In ACd) + ln ECo) ~ (E + £ «(1)) Dl o,d) 
- £ Cld)/e 


But because both In Ald) and £ C(d)/c are destination 
effects, they can and must be combined: 


CA9) Ln VOO0,d) = at(a) + Un ECo) — (£E + £ xk(1)) D(C oyd) 
WHERE a'(d) = ln A(ad) = £ CC d)/ce 


The parameters of an equation written in this form can 
usually be estimated. However, as noted, k(1) cannot be 
determined separately from £ so money and time costs are not 
separable because of collinearity. This is because, if 
parameters are to he estimated, there cannot be a linear 
relation between time bias and attractiveness that allows 
one to be explained by the other (the design matrix must be 
of full rank, see Reference 50)e However from Equation A6 to 
AQ one sees that as long as travel time is roughly 
proportional to Dlo,sd) and C(da) reflects costs rather than 
Clord), the terms in Equation A6 can be grouped as in 
Equation AQ and thus the critical time bias parameter k( 1) 
cannot be identifiede Thus there appears to be little 
likehood of an empirical derivation of the time bias 
constant! In fact, in experiments carried out using a model 
based on Ontario but in which t€Co,d) was considered to be a 
non-linear function of DCosd) (D(o,d) to powers of 1/2. ‘to's 
were usede ) So, it was still found money costs and time 
costs were too highly correlated to allow one to get good 
estimates of k(1)e 

At lteast one gets Equation AQ, unless a person has data 
on visitor use of one or more destinations for different 
admission fees (or if some parks charge different rates for 
people from different origins )e Time bias (kk) cannot be 
identified using Equation A6e 
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ELASTICITY WITH TIME BIAS 


: Earlier discussion cited a view by Duff and McConnell on 
elasticitye However, it was indicated that their statement 
lacked meaning until time bias was introduced into the 
demand function formulatione So now consider that the 
expression derived earlier means that the arc elasticities 
which are relevant are: 


EA = | (exp(-£b k(1)A) — 1)/Cexp(-E k(1)%) + 1) | 
ET = | (exp(-t A) — 1)/(exp(-£ A) + 1) | 

EA = admission charge elasticity 

ET = combined time — distance cost elasticitye 


Solving for the intersection of these two functions (EA = 
ET) one sees that for k(i) not equal to one they only 
intersect at zero and infinitye From the derivatives of the 
functions, one learns that slope of EA is less than of ET 
and both slopes do not change Sign but go to zero as A 
becomes largee So it follow that as soon as A is non-zero: 


EA < ET 


In economic terms the preceeding means that the demand 
function is more inelastic with respect to admission fee 
than it is with respect to total travel costse So this 
result is not fundamentally inconsistent with that found by 
McConnell and Duffe They simply stated that a decrease in 
demand due to the increase in travel costs is stronger than 
it is with the increase in the admission pricese However, 
one wonders if they were referring to short-term reactions 
to price changes rather than to the equilibrium demand 
elasticities computed heree 


AVENUES TO PURSUE IN EXAMINING 
THE DEMAND FUNCTIONS VARIATION OVER TIME 


When time bias is introduced into the demand function 
one has a demand function DTU(t) such that: 


DTU(t) = TV exp(—-£t k(1)A) 

but as earlier £ = l/c by the modelling assumptions so: 
DTU(t) = TV exp(-k(1) lL A/c) 

Given this expression one may see that: 

Proportional change in DTU(t) = af DTUC t ) J/DTUC t ) 


£iln DTU(t) 
aECE( ln DTU(t))/£E( parameter) )( change in parameter ) 


P DTU(t) 
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WHERE the sum is over all parameters 
daq{Uj/t+ af x(i)j/u(1) + aL aA]/a 


€A10) P DTUCt) = (-W/ce?) d{[c]+ wali ]s1) + 
df k(1)J]/ux(1) + aL A]/A 


WHERE W = —- k(1) 1 A/c 


Or where pc, pk( i), pl and pA refer to proportional 
change in the paramters: 


P DOTU(t) = WO-C1/c)pc + pl + pk(1) + pa) 


Equation A100 makes clear the relation between change in 
demand on which longitudinal information is available and 
other variables on which either longitudinal information is 
available and/or on which one may be willing to postulate 
what changes will be (or have been)e Examination of this 
function for an isolated park for which use has not been 
altered by the creation of alternative supply could prove 
very interestinge In particular the equation could be 
important in predicting short-term response to changes in 
the demand function parameters because in & year or two the 
supply affecting most parks changes Littles In the short run 
circumstances just indicated it is probably even appropriate 
to use the function to analyize changes in use of non- 
isolated parks as long as there is not a change in A 
involved that alters the role of the park with respect to 
alternative supply. 
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FIGURE 1 


CONSUMER SURPLUS VALUES IN DOLLAR PER PARTY TRIP 
FOR DIFFERENT TYPES OF VISITS TO CANADIAN PARKS* 


Consumer 
Surplus Impedance 
in $ per of 
Party Distance 
Trip Type of Trip (lL) Source 
Main Destination 
Camping Trip 
to Camp in @ 
Publicly Provided 
39.49 campsite*x* «00708 Wang* ** 
Main Destination 
3097 Day-Use e 0706 TN 7 
4283 Trip for Weekend e058 TN 35 
6036 Holiday trips** e044 TN 35 
5-60 e050 TN 35 
9e33 Enroute Stop-Over e003 TN 18 
Camping Visit 
* All computations are based on "the average party" having 


es: 


kok 


the belief that all relevant trip costs are such that 
each mile of travel costs 14 centse Thusy Cy the cost 
per 2-way mile (each mite must be covered coming and 
going), is 28 centse 


These figures should be taken to apply to weekend and 
holiday trips because, even though the studies did not 
separate out trips after work, the number of these is so 
small compared to other trips that they should only 
cause a slight low bias in the figures presentede 


The report is "A prediction Model for Camping 
Visitation", by Darsan Wange Report 158 of Department of 
Tourism Recreation and Cultural Affairs, Province of 
Manitobae 


OT OSOLOLS, 


28 CHAPTER Vil page 467 


CORD STUDY TECHNICAL NOTE 21 


A METHOD OF MONITORING PARK USE 
A DISCUSSION PAPER 


NeHe Coomber and Je Beaman 


ABSTRACT 


There have been numerous surveys carried out in which 
information was obtained from users of parkse This paper 
reports on how experience beginning with the Canadian 
Outdoor Recreation Demand Study park user surveys in 1968 
and ending with National Park park user surveys in 1974 has 
prompted the development and modification of various 
information gathering procedurese 

The report presents specifics on why certain procedures 
were carried out in collecting data and in weighting these 
datae As well there is & management perspective presented 
which raises questions about the need for the data that have 
been collected and about more efficient means for collecting 
informatione There is an underlying theme in the paper that 
for efficiency there must be a concern with the objectives 
of collecting information and with the desired or necessary 
accuracy of the information to be generated from surveyse 
These matters must be pursued before survey work is carried 
oute 


OBJECTIVE 


This paper describes the survey methods that have been 
developed and used by Parks Canada during the last five 
years for monitoring visitor use of Canadian National Parkse 
In describing the strengths and weaknesses of the procedures 
adopted, the objective is to evoke constructive critique 
that will result in the improvement or replacement of the 
survey methodse 


INTRODUCTION 


In 1970, the Park Use Research Section of the Planning 
Division of Parks Canada embarked on a five-year research 
program to aid planning and managerial decision making by 
developing standardized and economical methods of 
calculating the number of visitors to National Parks and of 
discerning a profile of park visitors and their activitiese 
Plans were made to begin a three-year cycle of surveys in 
1971 so that by 1973 users in all National Parks would have 
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been surveyede Part of the plan was to use a standard survey 
methodology, but major changes in the survey approach were 
made after the first yeare This paper is particularly 
concerned with the surveys made in 1972 and 1973, which took 
place at the parks identified in Figure le 

The objectives of the surveys were twofolde First, they 
were designed to obtain traffic partition rates for the park 
gates of the National Parks studiede These were considered 
very important because estimates of the numbers of park 
visitors in future years were to be calculated using simple 
traffic countse Second, the surveysS were to obtain activity 
and socio-economic profiles of visitors to each park for 
general managerial and planning purposeSse 


GENERAL SURVEY STRATEGY 


A survey method was developed which involwed speaking 
to persons in *selected* motor vehicles which entered 
National Parks by any of the major points of entrye In 
1972-3 Parks Canada's concern was with visitors entering 
during the peak use season of June 1 to Labour Daye Thus as 
selected times of selected days during this period survey 
stations were located as close as possible to appropriate 
park boundaries, and workers at these tentrances’ recorded 
information in the way described belowe 

However, before describing the general strategy; one 
should note that the nature of visitor traffic in National 
Parks (and some other matters) were the cause of a number of 
concerns in developing a survey designe 


ae Traific volumes varied greatly between 
different park gatese 


be Traffic volume varied greatly on different days 
of the weeke 


ce Traffic content in terms of, for example, the 
proportions of Canadian visitors in the 
entering traffic, varied over the summere 


de Information was needed on visitor use of parks 
outside the high-use season from July until the 


middle of Auguste 


ee Traffic tie-ups at park gates were likely to 


occur when traffic was heavy if a high 
proportion of the entering vehicles were to be 
stoppede 


fe Surveying visitors at exit points would involve 
problems for visitors in recalling their use of 
park tacilities and their other activitiese 
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TN 21 


ge The fact that high proportions of non-visitor 
traffic passed through some park gates meant 
that stopping a fixed percentage of incoming 
vehicles could result in very few contacts with 
visiting partiese 


There was no need to stop transport trucks or other vehicles 
that could be identified as non-visitor vehiclese But if 
interviews were only conducted with passengers in every 
tenth vehicle who turned out to be visitors, and if only one 
wehicle in five contained visitors, the sampling rate would 
be essentially one in fifty! 

Because the size of the universe of entering vehicles 
varied greatly at some park gates (ae and be above) it was 
desirable to allow a survey crew to stop and interview as 
many wehicle parties as possible, but because of the 
inevitable problem of traffic hold-ups (ee Jo it was not 
feasible to conduct a census of entering traffice Moreover, 
because of low proportions of visitor traffic to total 
traffic at various gates at various times; a standardized 
fractional sample applied to the whole park system was 
undesirable (ge)e To avoid these problems; 4 sample was 
obtained by a procedure that involved an ®interviewer'® 
stopping a vehicle, ascertaining specific information from 
the driver, tpassing® that vehicle and then stopping the 
next vehicle that passed the survey station that was not 
visually identifiable as a non-visitor vehiclee The type of 
sample obtained in this way was termed a 'floating® samplee 
It reduced the amount of time spent idle by interviewers 
during times of tight visitor traffic, and eased traffic 
hold-ups at gates during heavy traffice 

Actually, the preceding description is an 
oversimplification of the sampling proceduree In fact, three 
types of records of the numbers and characteristics of 
visiting parties were obtained in the surveys: (1) traffic 
counts, (2) entry records, and (3) handback questionnairese 

Traffic count totals for each half hour on all vehicles 
entering each park gate during survey sessions were recorded 
on the form shown in Figure 2. (Fisher-Porter automatic 
traffic counters with a fifteen minute paper tape output 
have since been used successfuly in a survey of Prince 
Edward Island National Parke The use of these counters 
reduces manpower requirements in future surveys by removing 
the need to have a person count carse) Vehicles which could 
be identified without being stopped as not containing park 
visitors were excluded from the count and classified as "X!* 
in Figure 2e This was done (1) to avoid unnecessary 
inconvenience to business and commercial traffic, and (2) to 
provide a more accurate measure ot the percentage of the 
universe that was visitor vehicles, than could be obtained 
using a sampling approache These vehicles were also ignored 
by the interviewer who stopped and interviewed people only 
in potential park visitor vehiclese 

Contact between an interviewer and the occupants of 
those vehicles identified as containing potential park 
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Figure 2 


i+ Indien and Norther Afiaire Affaires Indlennes ot du Nerd 1973 PARK VISITOR SURVEY TRAFFIC RECORD 
Parks Canade Parce Canade FEUILLE DE DENOMBREMENT DES VEHICULES 


[° | [+] [2] Job Number TERRA NOVA PARC 
N° de la t&8che NATIONAL NATIONAL 
Nations! Park 
[1] Parc national PARK Una hes 
‘a Gateway 
Entrée 


Work Period Interview Period Nonseinterview Period Surveyor 
Période d‘observation Période des entrevues Période sans entrevues Releveur 


Identified identified 
Starting Time Length Universe Vehicles Universe Vehicies initisls 
Début Durée | Tous les véhicules Véhicules | Tous les véhicules V6hicules Initiales 
classés classés 


le ee 


See instructions on reverse 


NPC BOIN (4-73) Lire les instructions au verso 
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visitors involved the filling out of an entry record by the 
interviewere Entry records were short questionnaires 
cempleted by a surveyor during short interviews with those 
drivers of vehicles that were flagged down and who pulted 
off at the survey statione Part of the interviewer's task in 
obtaining entry record data was to determine, by a series of 
questions, whether or not the party should be given a 
handback questionnairee 

The information requested (see Figure 3) was used by 
the surveyor to classify parties who were stopped and 
interviewed according to the purpose of their visitse 
Parties stating that they were entering to use facilities in 
the park were classified as park visitorse (Park visitors 
were also classified as repeat-entry visitors or first entry 
visitors but this distinction is not pursued heree) For 
these parties, the size and composition of the parties and 
their origin according to the vehicle Licence plates were 
observed and noted on these formse The driver was then asked 
about the party's proposed Location of accommodation that 
night and their length of stay in the area, and the 
responses were recorded on the entry recorde Handback 
guestionnaires were given to these visitor parties, while 
for other parties only their trip purpose was recorded on 
the entry recorde 

Handback questionnaires were Longer forms than entry 
records and were attached to the bottom of entry records 
which had a common serial number (see Figure JG)e The 
*handbacks’ were torn from the bottom of entry records on 
completion of driver interviews and either placed in a waste 
bin or given to partiese On giving the handback 
questionnaire to park visitor parties, they were asked to 
have the party head complete and return it to deposit boxes 
situated near the park boundary beside the different roads 
that could be used to exit from the parke They were asked to 
keep the questionnaire for the duration of their visit and 
only return it on their final exit from the parke A mailing 
address was provided for respondents who omitted to do thise 

The reason for the use of a handback questionnaire 
after an interview at the park entrance was to provide 
information which described the party's use of recreation 
and accommodation facilities in the parky their actual 
(rather than intended) length of stay, and their preferences 
for types of accommodation and services in the park (see 
concern (£) above and Figure 3)e Having the thandback 
questionnaire available during the entire visit to the park 
made it possible for parties to record information during 
the course of their visit, thus reducing recall problems 
that could have effected responses to interviews conducted 
with exiting visitor partiese In addition, had an exit 
survey been carried out: 


ile There would have been logistical problems as a 


result of the varying length of time required 
to complete the questionnairese 
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Figure 3 


1973 PARK VISITOR SURVEY ENTRY RECORD 
TERRA NOVA NATIONAL PARK CLASS PARTY SiZE 


(0 } (7) PARK 3 [J eusiness 1 ([] THROUGH TRAFFIC eas ADULTS No 1 13 14 ; (JE) 


; 5 TOUR BU 4 PARK VISITOR 
SURVEYOR’S NUMBER " : 3 C) CHILDREN ACCOMMODATION 
6{_}comm. pus  2(_] REPEAT VISITOR 
1 fast INSIDE PARK 2 Oo OUTSIDE PARK— 


7 (J RESIDENT NON HOME 
INTERVIEW TIME 3 Fauecg ie (_] UNKNOWN 
(24 HOUR CLOCK)! province OR STATE OF ORIGIN LANGUAGE 
1 CJENGLISH LENGTH OF STAY IN REGION 
(cco Ras id 2] FRENCH ay al NIGHTS 
1973 PARK VISITOR SURVEY HANDBACK QUESTIONNAIRE JOB NO. 


Ew Parks Canada Parcs Canada TERRA NOVA NATIONAL PARK Ne = 1 13 1 4 < IIE) 


PLEASE ANSWER ALL OF THE FOLLOWING QUESTIONS 


1. How long do you plan to be away from home on your trip? 


t i J} Number of Nights 


2. How long did your party stay in this Park on this trip? Number of Hours 
(if you did not stay : 
een ontl OR fen ) Number of Nights 
3. If you stayed overnight away from home on this trip, please indicate the number of nights spent in each of the following types of accommodation. 
[nside the Park Rest of the Trip to Date 


Commercial! House- ; E Mae 
keeping Cottage Cabin Trailer L } Tent n 1 ) Tent Trailer n j Rented Cabin 


‘ Self-Contained Commercial Private Home 
pat rent Li __J Tent Trailer LL J (Camper truck) Li poise eceinG L_1t 5 or Cottage 
ottage 


Self Contained 
Lt (camper truck) es Other UL—t__J Cabin Trailer L_1 J Hote! or Motel L414) Other 


4. Where is your present home located? 


AND Ge ES ee OR 
Town or City Province or State matin Country other than Canada/USA 


5. Which one type of overnight accomodation would your prefer to use in or near this National Park? 


1. leeWor COnnSS TIONS: Cana e aOR hick etc.) 6. C) Rented Cabin—no housekeeping 
2.0] (olets,’ small food concecs one eteepeeed 7. [_] Commercial Cabin— housekeeping 
[5] Sewer connections Werte Hep seviced 8. C Hotel or Motel 

4. a etree Nabe sii i semi-seryi So: fe} Private Home or Cottage 


Nationa! Park campground 
5. OC unserviced or primitive 10. CT] Other (please name) 


6. Did vou experience any difficulty in finding campground accommodation within the Park? (Please check all that apply.) 


No 2 Forced to Arrive 3 ia Campgrounds Full — camped in 4 0 Campgrounds Full — will look for oO h 
F Difficulty ‘ Early (before 3 pm.) ¢ Park’s overflow accommodation i accommodation outside the Park oa Osnes 


7. HOw many times did your party use each of the following Park facilities on this visit to the Park? 


Indoor Information 
Cli Prenie Ground Lousy Unserviced Beach Luu) interpretation Centre ti _J Bureau Ue} J Unguided Trails 
Boat Launching, Mooring Indoor Audio- Unguided 
Loot J Picnic Areas LH J or Docking Site Loo») «Visual Theatre {Ly Hiking Trails Lt} Canoe Routes 
Lut Jj Wayside Stop Lou) Boat Rental Sites Lui} Outdoor Theatre Lt Guided Walks Ltd} Fishing Areas 
Personal Contacts 
Leu} Viewpoints Lt _j Day Use Area Ls with Naturalist Louw J Boardwalk Lt j Campgrounds 
Guided Inter- Wilderness 
Li] Serviced Beach Li Childrens Play Area Cdl Exhibit Trailer 11 pretive Trails L1__J Camping Areas 
- Which if any, of the following would you have used if they had been available in this Park? (Check al! that apply) 
(_] Boat tours [_] cycle trails (_} Deep-sea fishing 
[_]‘‘Boat only"* access campgrounds [_] Motor boating [_] other (name) 
{inet} “Bike only’’ access campgrounds (ea) Ocean tour boating Ce ee 
9. If you have any comments you would like to make on your visit to this National Park, please make them below 


AS YOU LEAVE THE PARK, PLEASE DEPOSIT THIS CARD IN THE BOX LOCATED AT THE PARK EXIT. THANK-YOU. 
if you are unable to deposit this card, please mail to: Chief Planning Division, National Parks Service, 400 Laurier Ave. West, Ottawa, K1A 0H4 
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[el hic | besean sont Del lol] aay chart gia ent en ae Eelam ee 


NPC 8186-1 (4-73) 


FRANCAIS AU VERSO 


TN 21 CHAPTER VII page 474 


2. Traffic problems would occur if the required 
number of cars was stopped at a given time in 
order to obtain the Same accuracy as an 
entrance survey with about a 50% return of 
handbacke (A survey in i972 at Riding Mountain 
National Park was intended to assess the effect 
of non-response in creating bias, resulting in 
differences between entrance and exit surveys, 
but the need to prepare a French questionnaire 
resulted in data collection problems and 
subsequent cancellation of work on the surveye ) 


3e The interviewing of exiting visitors was of 
concern to management because of the possible 
reaction of some parties to being stopped when 
they were "all packed and raring to go"e 


4e Exiting parties could not be relied upon to 
supply surveyors with accurate information on 
their gate and time of entry; therefore traffic 
partition factors derived from exit surveys 
could only be used in subsequent years by 
taking counts of exiting traffic, instead of 


using the entrance data collection system 
presently operated by permanent staff or 
automatic traffic counters at most park 
entranceSe 


In order to sample as complete a cross-section of park 
visiting parties as possible using the 'floating sample 
entrance survey’, the survey schedule for each gate in each 
Park was stratified on the basis of known variations in the 
content and volume of visitor traffic to National Parks at 
different times of days on different days of the week, 
during different parts of the seasone On the basis of the 
beginning and end of American and Canadian school holidays, 
which have a considerable influence on the volume and 
content of traffic entering National Parks, the season was 
divided into four parts: June 1 to June 16, June 17 to June 
20, July 1 to August 16, and August 17 to September 4e On 
the assumption that traffic volume was higher and content 
more varied on weekends than on weekdays; weekends were 
sampled more frequentlye This was achieved by dividing each 
week in the season into four types of day, each of which was 
sampled with similar frequency: Saturdays, Sundays 
(including holiday Mondays), Fridays, and weekdays excluding 
FridaySe (Defining an optimal balance in the schedule 
between weekend and weekday sessions depends on knowledge of 
traffic distributions or in theory could be achieved with a 
Baysian sampling approache However, the latter would have 
presented severe practical problems in repeatedly changing 
schedules or only establishing them as data were receivede 
Parks Canada's data for traffic distributions is being 
applied to optimal (cost-—accuracy) Survey Strategiese See TN 
1GSe) Because traffic also varied with the time of day, each 
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day was divided into two periods, mornings and afternoonse 
Thus, for each gate in each Park, at tleast one survey 
session in each part of day (3 parts in 1972, 2 parts in 
1973), in each type of day, in each part of the season was 
schedulede In scheduling any additional sessions, preference 
was given to gates, periods and day-types which normally 
experienced heavy traffic or traffic that varied 
considerably in content, or to periods when heavy traffic at 
particular gates was known to occure 

The purpose of these stratifications was to obtain sets 
of entry records collected during surveyed sessions which 
could be ‘imputed® into unsurveyed sessions at the same gate 
because they could be judged to be similar in traffic 
content according to their part of the SCAaSOns, type of day 
and period of daye In other words, it was assumed that 
variations in traffic content were greater between these 
stratifications than within them, the stratifications could 
be used to improve the accuracy of inflating the sample to 
represent the universe. 

On completion of the surveys, all records were returned 
to Parks Canada headquarters in Ottawa where they underwent 
manual screening and keypunchinge The records were then 
computer-edited to check the internal consistency of all 
three types of recorde Subsequently, traffic counts were 
cross-—checked with the numbers of entry records in order to 
detect any errors which led to a larger number of entry 
records existing for a given half hour than were vehicles 
counted during that half hour survey sessione Error lists 
were produced and corrections made, and three separate, 
edited and sorted files were eventually produced. 


WEIGHTING 


Entry Record Weighting 


The first task undertaken with the entry record file 
was to calculate sampling rate weights for each and every 
half hour during which entry records were distributede This 
was done in order to inflate the entry records to be 
representative of the universe of traffic stopped at each 
gatee 

The size of the universe of visitor traffic during each 
half hour was estimatede The proportion of visitor vehicles 
of all vehicles which were stopped was assumed to have been 
the same for the total universe of vehicles which had been 
recorded as potential visitor vehiclese As explained above, 
this count had separated vehicles which could be identified 
without being stopped as non-park visitor vehicles, from 
possible or ‘interviewable! vehiclese Thus, for a particular 
half hour survey session at a particular gate, let 
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iu = total universe of all vehicles entering, and 


x = number of vehicles identified as non-park visitor 
vehicles without being stoppede 


Then (u-x) is the number of potential park visitor 
vehicles of which a certain number was stoppede 


If t = total number of potential park visitor 
vehicles stopped 


The sampling rate is t/(u-x) and thus the sampling rate 
weight may then be expressed as: 


Sampling Rate Weight for a given gate, 
for a given dayy 
for a given half hour: 


It is notable that where x is large, that is a high 
proportion of entering traffic could be identified as non- 
park visitor, the sampling rate weight is more accurate than 
would be the case if identifiable vehicles were not countede 
Thus, by identifying that part of the traffic to be stopped 
for interviews, greater accuracy was obtained than would be 
the case by taking a sample from the universe of all 
entering traffice 

The use of weights to inflate the number of entry 
records to represent the universe of visitor traffic during 
surveyed sessions is shown symbolically in the five figures 
beginning with Figure 4 In Figure 4, the shaded areas of 
the left-hand portions of particular periods, represented by 
complete cells, indicate those half hour sessions during 
which surveys took placeée For the sake of simplicity the 
shaded areas are not drawn in proportion to the sizes of the 
samples, nor are the cells drawn in proportion to the size 
of the estimated universes of visitor traffice The diagonal 
shading in Figure 5 shows symbolically how sampling rate 
weights were used to inflate the number of vehicles stopped 
to represent the estimated universe of visitor vehicles for 
each half hour surveyede 

In Figures 4and 5 it may be noticed that there were 
"time gaps'* when no interviewing took placee During some 
survey sessions rain prevented the interviewing of entering 
vehicles, while sickness of survey crews or transportation 
difficulties also precluded a record of entering vehicles 
for various lengths of times In addition, during each survey 
sessiony a meal break of one hour was taken by crews, 
resulting in there being no entry or traffic records for 
those sessionSe To fill these gaps, entry records of the 
same day from adjacent time periods were duplicated to ‘fill 
in for'® the missing records as an estimate of what would 
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have been observede 

The actual procedure used in making these ‘time fills? 
depended on whether or not records existed for the time 
periods before and after a gape Where such records existed, 
the records from the preceding and following strata were 
duplicatede Since the length of time of the period to be 
filled and the 'filling' periods could differ, and typically 
one half hour was 'filled® by two thirty—minute periods, the 
sampling rate weights of duplicate records were modified by 
multiplication by another weight calculated as: 


number of minutes in time period 
to be filled 
Time Fill Weight = -——----—-—-——-——---—~—~— ~~~. LF 
number of minutes in 'filling® 
period 


When a gap occurred at the beginning or end of a day, 
records from the surveyed half-hour periods adjacent to it 
were used to fill the gape The effects of the time filling 
procedure just described are shown symbolically in Figure 6 
by the vertical shading. 

The use of sampling rate weights and time fill weights 
effectively provided complete sets of records for all 
periods during which a survey took place. These completed 
sets of entry records could then be @iven a further weight 
in order to inflate the number of records, so that weighted 
tabulations would give correct estimates of the total 
numbers of visiting vehicles entering each parke The weights 
used to do this, with the exception noted later, were 
manually calculated, and thus have been called *manual 
weights". These weights were equivalent to the number of 
times that data for a given complete period would have to be 
duplicated in order to 'fill*® all periods of the same type, 
on the same day-type and in the same part of the season at 
the same gatee For example, at a particular gate, there are 
data from a Saturday morning which was the only Saturday 
morning on which a survey took place at that gate in that 
part of the seasone If there were three Saturdays in that 
part of the season, the weight of each record from the 
particular Saturday morning would be multiplied by a manual 
weight of 3-0. If there had been surveying on two of the 
three Saturdays in that part of the season, the manual 
weight would have been 1.564 This weighting inflated 
observations ot give estimates for a major part of the park 
visitor traffic, as shown in Figure 7 by the horizontal 
shadi Ne 

As Figure 7 shows; after time filling and manual 
weighting the only traffic for which there are not estimates 
occurred when, through error or logistical problems, such as 
Sickness of crew members or rain ? an entire period of a 
given day-type within a part of the season, which was the 
only period of that type missede In Figure 7, for example, 
this is shown as weekday afternoons at the south gate of 
Terra Nova Park during part 2 of the summer. Under these 
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circumstances, ‘estimates'" of the missing data are created 
from data of the same day-type in another part of the summer 
which were duplicated to fill the gape The weights on the 
records which were duplicated were multiplied by a 
*treassignment weight* which generally reflected the 
different total traffic for the period with no data compared 
with the total traffic volume associated with the records 
which were reassignede (Space does not permit the detailed 
explanation of how this weight was estimated; further 
information on this and other details may be obtained from 
Parks Canada on requeste ) Reassigned entry records were 
duplicated to fill missing survey sessions as shown 
symbolically in Figure 8 as the speckled arease Copies of 
records were given the reassignment weights to correct for 
the different traffic volumee 

The variables tdate’ and ‘month' were invalidated by 
both types of manual weighting described aboveée In order 
that tabulations of data using the variables ‘date? and 
‘month? would not contain incorrect frequencies, the volumes 
of these variables were scrambled and made unavailable for 
normal tabulationse 

Thus, at the end of the weighting process, all original 
entry records received a sampling rate weight, andy, since no 
gate was surveyed on each of a particular period and type of 
day in the same part of the season, & manual weighte The 
duplication of records for time filling and reassignments 
was undertaken simultaneously and the weights assigned as 
shown in Figure 9- As the figure shows; the weight of entry 
records was the product of three weights if they represented 
entry records in a surveyed session which was reassigned to 
another part of the seasone 


Handback Record Weighting 


The weighting of handback records required an 
additional special consideratione Since not all 
questionnaires for each surveyed period was returned; 


handback questionnaires could not be weighted simply by 
matching each one to its *parent'? entry record by serial 
number and then giving it the ‘parent's! weighte This 
procedure could be undertaken for returned handback records, 
but obviously could not be used for unreturned handback 
aquestionnairese 

SoOg the assumption was made that the content ot 
handback ocvestionnaires of parties which did not respond was 
Similar to that of similar parties which did responde In 
other words, entry records with responses that were Similar 
according to a number of criteria should, it was assumed, 
have *fathered!® Similar thandback recordSe Thus, entry 
records for which handback questionnaires had not been 
returned were each tested for their similarity to entry 
records for which handback questionnaires had been returnede 
Entry record ‘similarity' was used as ae criteria for 
imputing that a given handback record should be assigned to 
an entry record for which no handback had been returnede 
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(The authors! justification of this assumption is that the 
latent structure model of Lazarfeld and Henry (see Reference 
39) that suggested that different probabilities of handing 
back questionnaires are associated with people in different 
collectivities which in this case are defined toa high 
degree by the entry record variables. ) 

To accomplish the desired matching, a file of original 
entry records was split into two files, one containing entry 
records with thandbacks and the other containing entry 
records without handbackse In this process, six new 
variables were created by arbitrarily grouping the values of. 
the existing variables of origin, number of adults, number 
of children, day of week, accommodation, and length of staye 
Entry records were judged to be similar if the walues of the 
newly created variables and of several original variables 
matchede 

The use of the newly created variables meant that entry 
records were to be judged similar only if they were 
distributed at the same gate, within an hour on the same 
type of day and within 28 days of each other, to parties 
having the same region or origin, size, number of adults, 
Similar intended length of stay and similar intended type of 
accommodations Specifically (as shown in Figure 10) in the 
matching procedure the original ungrouped variables were 
dropped one at a time from each series of comparisons of 
entry records with no handback and with all entry records 
with handback recordse The number of variables dropped 
before one or more matches was found determined the level of 
matche Thus entry records with original handback records 
were automatically assigned a match Level of 1, those 
matched after the relaxation of the variables ‘number of 
children’ and ‘'number of adults’ were assigned a tlevel of 
match of 2 and so one A *tlevel 4 match!’ would occur where 
two entry records with the values for all variables except 
serial numbers, number of adults and children, day of week 
interviewed, and accommodation intendede When a match to an 
entry record was found at a given level of match, 
comparisons at this level were continued to the end of the 
file in order that additional, equally similar, matches 
would also be founde The matching procedure was’ then 
repeated for the next entry record with no handback and 
iterated until the end of the file of entry records with no 
handbacks was reachede 

Each entry record in the file of those without handback 
records which was matched to one or several entry records by 
this procedure was then identified by its serial number and 
that of the similar entry record(s) with the handback 
records as shown in Figure lie The number of matches and the 
level of match were also recordede As shown in Figure Ul, 
when no similar case could be found for an unmatched entry 
record after all the constraint rules on matching had been 
relaxed, a dummy handback record was created with ‘missing 
values! of zero for each variable. In order to be able to 
distinguish dummy from other handback records on the final 
files, a level of match of zero identified dummy handback 
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IMPUTATION RELAXATION RULES 


Imputation Level 


Variable Name 


Park * 


Gate * 

Area of Origin 

Nogrot Adults — Grouped 
Party Size - Grouped 


Day of Week Interviewed - 
Grouped 


Accommodation Used - 
Grouped 


Length of Stay in Days - 
Grouped 


Time of Survey (same 
half-hour) * 


Day of Survey '* 


Cbase of Entry’ * 


Legethe oftistay, in Days .* 
OPTeii OL rarty .™ 


Accommodation Used * 


Day of Week Interviewed * 


AGu.tces °* 


Children * 


Serial Number of Questionnaire * 


Original variables 
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recordse 

To make the searches for matching entry records as 
brief as possible the files of matched and unmatched entry 
records were sorted by the values of each of the variables 
used in assessing sSimiltlaritye In this way, an entry record 
with no handback record would only be compared with a group 
of entry records having the same values of park, Gate, 
origin code etce, and each comparison could be completed 
with a single pass over a small part of the file of entry 
records with handback recordse 


DISCUSSION 


The following discussion focuses on five areas that 
require some consideration in evaluating the procedures 
adopted for this and other similar surveyse 


survey Biases 


At Least two important sources of bias are found in the 
survey procedure described above which require some 
discussione 


1.2 Floating Sample Biases: 


Using a floating sample with half-hour time intervals, 
and assuming that the average volume of traffic does not 
change drastically during a particular half-hour, a problem 
regarding the randomness of observations is encounterede 
Changes occur in the composition of entering traffic; 
usually signalled by rapid change in traffic volume, eoge 
from park staff driving to work to visitors, or from non- 
resident arrivals to the rush of tlocal people on Fridaye In 
half—hours where such changes take place, the Low volume 
traffic will be over-samplede However, this bias occurred 
only in a few sampled half-hour periods each year at only a 
few parks and is unlikely to effect the highly aggregated 
figures by more than a fractione 

An important problem with floating samples arises from 
the perpetually thigh probability of stopping slow vehicles 
that are at the head of line-ups of their own makinge Such 
cars may be driven by the elderly, may be touring recreation 
vehicles, or may be slow for some other reasone A 
consequence of this clumping problem is that recreation 
vehicles travelling together, such as caravanS, are Likely 
to be subject to only one interviewee However, while ‘traffic 
volume equivalent® of the caravan would suggest a large 
weight for the interview, the weighting procedures will 
assign the caravan wehicle the same weight as a private car 
travelling alonee 

The Latter problem could be remedied by simply applying 
a special inflation weight to caravans depending on the 
number of vehiclese To prevent the bias caused by selecting 
cluster leaders, a surwey procedure for selecting a random 
potential visitor vwehicle in a cluster could be usede 
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Figure 11a 


EXTRACT FROM TYPICAL WORKING FILE PRIOR TO IMPUTATION PROCESS 
Sorted by Park and Serial Number 


PARK ENTRY RECORD HANDBACK RECORD 
NO. SERIAL NUMBER SERIAL NUMBER 


01 00567 435 00567 
01 00568 4.5 unreturned 


unreturned 


Ol 00571 32.0 unreturned 
Ol 00572 ; 3.0 00572 


unreturned 


01 00576 | 00576 


Ol ery. 00577 
Figure 11b 


EXTRACT FROM TYPICAL WORKING FILE AFTER IMPUTATION PROCESS 
(sorted by Park and Entry Record Serial Number ) 


PARK ENTRY RECORD WEIGHT HANDBACK RECORD NUMBER OF MATCH 
NO. SERIAL NUMBER SERIAL NUMBER MATCHES LEVEL 
| 
Ol 00567 4.5 00567 1 1 
Ol 00568 4.5 03692 1 ean 
01 00569 4.5 00985 1 at] 
O1 00570 3.0 00570 wast eae | | 
O1 00571 1.5 03980 — 2 Jeivitiz Wis : 
Ol 00571 7s 01197 2 pmaie | 
01 00572 2.0 00572 1 pyar on 
01 00573 5.0 00573 0 q tere 
01 00574 3.0 00574 1 ] | 
O1 00575 StU 00575 1 
01 00576 1 ele 
01 005 1 Sen 
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At Least two advantages of a floating sample over fixed 
sampling rates should also be noted: 


(a) surveyors used their time more efficiently 
than in procedures using fixed sampling rates, and 


(b) the problem of traffic congestion that can be 
created when employing a fixed sampling rate 
was avoidede 


2e Entrance Survey Biases: 


Probably the most dependable method of checking the 
reliability of an entrance survey is to undertake a 
simultaneous exit survey and compare the results of the two 
surveyse As mentioned above, Parks Canada attempted to carry 
out such a nonresponse bias survey in 1972 in which the 
underlying assumption, which may be questionable, was that 
aggregated responses to an exit interviews would be tess 
biased than estimates that depended on making corrections 
for nonresponse to handback questionnaires. It was 
unfortunate that it was not possible to finish this survey 
as a result of difficulties in printing a second set of 
handback questionnairese Actually, it is likely that because 
of biases occurring in how people answer questions on how 
long their visit was, what they did when they were about to 
Leave and other questions, both entrance and exit surveys 
may be biased! 

It is possible, however, to identify (if not measure) 
three important sources of bias that occur when using 
handback questionnaires: 


(i) The handback questionnaire may influence 
behaviour by providing suggestions for 
activities that they would not have been 
considered otherwisee : 

(ii) The handback questionnaire can be tlost 
during a recipient's visit to a park and the 
Longer the stay the more Likely is the loss 
or destruction of a questionnairee It was 
hoped, however, that losses occurred at a 
Similar rate by visitors with similar entry 
record characteristicse Return from an exit 
survey are not likely to be loste Nor; are 
they destroyed for any number of reasonse 
Reasons for nonresponse at an exit are much 
clearer than reasons for nonreturn of a 
handback questionnaire! 


(iii) Multiple entry bias is created when some 
people enter a park many times on a simple 
visit while others enter only  oncee The 
latter causes a bias similar to the one that 
arises when a surveyor moves around a park 
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interviewing people at randome In the one 
case the visitor who enters several times 
has a higher probability of being 
interviewed in an entrance survey than the 
person who enters once; in the other type of 
survey a person's probability of being 
randomly selected arises in proportion of 
the amount of time he spends being available 
to be selected at randome This is, of 
course, controlled by only collecting data 
and distributing handbacks on first entry. 
However, a number of "day visitors! stay in 
a campground just outside & park, and 
‘should? according to some definitions of 
first entry visitors, be interviewed each 
day while people who stay a short distance 
from these people, but within the Park, 
should only. be interviewed on first entry. 
Do these different visitors really know what 
their first entry is or can an interviewer 
clarify this easily? Evidence froma 1974 
survey of PeEelIe National Park suggests that 
the answer is noe 


Apart from these disadvantages of using an entrance 
survey there are two notable advantages: 


(i) People were not inconvenienced by being asked 
questions as they were leaving a Park but 
instead had a questionnaire with them while 
in a Parke This allowed people to use their 
questionnaires to record activities as they 
were performed, thus reducing problems of 
recall hbiaSe 


(ii) Even if an exit survey had been carried 
out using a ‘floating sample’, there would be 
Little difference in the number of completed 
questionnaires obtained from a given input of 
surveyor's timee 10 minute interviews at the 
Park exit, instead of 5 minute interviews at 
the Park entrance, would have produced a 
Similar number of records if 50% of handback 
questionnaires were returned, as was the case 
in 1973-2 Moreover, obtaining answers to all 
questions for almost all interviews is ltless 
likely to effect the reliability of results 
than would the inflation of 50% response to 
represent the universe. 


‘Survey Efficiency 
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This paper has introduced a number of survey design and 
weighting considerations that should be of relevance to 
persons carrying out such surveys or any similar surveys of 
people, on foot or in vehicles, who enter a geographical 


area subsequently Leave ite It must be recognized that the — 


: 


' 
{ 


paper has not dealt with a number of important issues 


including: 


(a) Whether a survey is the appropriate or most 
cost-efficient method of obtaining certain 
informatione 


(b) Why a simple random interviewing of people, 
for example in a camping area, is not a more 
effective strategy for obtaining data on park 
users than that described aboves 


(c) How to calculate the appropriate amount of 
sampling and the appropriate numbers of survey 
sessions on given day-types to achievwe a cost 
efficient survey that, for example, has 99% 
probability that all estimates for the numbers 
of people being in certain values of i10 
critical wariables will be within 1% of the 
true numbere 


Much information on park visitor activities can be 
measured in ways other than surveys more accurately than by 
SurveysSe Campground registrations or adult and children 
ticket sales are good sources of data (a)e (A comparison of 
total traffic volumes estimated by the 1973 surveys and by 
permanent traffic counters is shown in Figure 12.) However, 
when data on a party's activities ina park are to be used 
to analyse a park's operation, other data that presence of a 
party or person at a specific location are needede Data ona 
person's movement within a park may be collected by the use 
of plastic cards similar to a credit or computerized Library 
card (such a system as in operation on Long Island in New 
York State)e Actually, a procedure tested at Gros Morne 
National Park in 1974 in a visitor survey carried out by 
Parks Canada involwed assessing the use of a National Park 
by keeping a record of vehicle Licence platese Although not 
completely original, the success of the method suggested its 
extensive use in 1975e In some parks where activities are 
highly oriented to moving from place to place by automobile, 
a large amount of information can be gained from licence 
plates and visual observations made by surveyors of the 
people travelling in a vehicle or people getting into or out 
of a vehiclee 

It is important to note that an optimal survey 
technique makes efficient use of time and money, and the 
1972-73 strategy described above does not! As suggested in a 
footnote, automatic traffic counter can perform all the 
activities of the traffic counting member of a two-man 
survey crew except for counting cars that are not park 
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| 


| Figure 12 


COMPARISON OF ESTIMATED AND OBSERVED 
ENTERING TRAFFIC 1972-3 


TOTAL ESTIMATED FROM 


TOTAL RECORDED BY PERCENTAGE 


PARK VISITOR SURVEYS TRAFFIC COUNTERS DIFFERENCE 


L Ove 
Banff 838,402 961,158 De ae 
Kootenay 25077 TH unknown a 
Yoho 400,985 405, 359 1.1 
Jasper BOT; 30S 316,465 Lieto 
Fundy 174,269 MWg Ae th) 1.6 
1973 
Terra Nova 159 A179 not reliable* on oe 
Kejimkujik 38,088 41,7289 = Re 
Forillon 76,666 68,952 Lee 
La Mauricie 19% 665 LER 37'2 205 Jee 
Point Pelee re Merl eo Aes 72,686 ceaibiess tI) Rabi i 
Riding Mountain 166,816 LOZ ooL mm Lia 
Prince Albert aS tee 9 48,419 at! Qh 
Pacific Rim 81,347 86,959 ag ee Lite) 
Traffic Counters were not accurately calibrated 


Large differences may be accounted for by large 
amounts of night-time traffic or by pedestrian or other 
non-vehicle traffic during the daytime. 
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visitor vehiclese A mechanical counter costing $2,000 (1975 
price) can be set up to count traffic in one direction, to 
punch on tape the total traffic every 15 minutes, and to 
work 24 hours a day, 7 days a week with Little servicing, no 
personnel or hiring problems, accomodation problems and so 
one A survey crew member counting traffic for three months 
working 37 hours a week costs well over $2,000 plus 
expenses, and does not produce accurate data by 15 minute 
intervals ready to be automatically converted to punch cards 
and processed by a computere The traffic counter performs 
more than four times the amount of counting as a crew member 
who works one-fourth of the hours in any weeke Thus, a crew 
member and data processing staff can be replaced by a 
traffic counter and the remaining surveyors instructions 
modified slightly so he counts obvious non-park visitor 
traffic and achieves better results than in 1972-73 ata 
lower cost even if the traffic counter were thrown out at 
the end of a summer! — 

In a 1974 survey at Prince Edward Island National Park, 
where the traffic counter strategy was used, students could 
perform double the amount of surveying that they could using 
the old strategy because they did not count traffice I[t was 
also useful for improvement of weighting to have hourly 
traffic data, 7 days a week all summer. Also since traffic 
counters can be used for other purposes during the winter 
and are good for 10 years cost efficiency of surveying was 
greatly raisede 


Survey Assumptions 


Having ascertained that a survey is needed and 
efficiently designed vis avis (a) and (b) above and even 
to the detail noted in (c), there are several sources of 
inaccuracy in returns that result from making the 
assumptions that (1) all people in a party would respond in 
a Similar manner to a question, or (2) that all the people 
in a party participate in the same activities, and (3) that 
the responses of randomly selected party members can be 
inflated by party size to obtain a ‘universal’ picture of 
the users! opinions or activitiese 

It is hoped that in making these assumptions in 
collecting entrance survey data and in the weighting 
strategy used, returns provided: 


(1) Data on parties, obtained from answers to 
questions that could be asked of whole 
parties, eege *How many nights did you stay at 
the Park?‘ (A not uncommon problem occurs 
when, for example, the husband leaves his 
family ina park for two weeks and visits them 
only on week-endse ) 


(2) Answers from individuals to 'tindividually 


focused? questionse Alternatively, by 
obtaining a response from a random individual 
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appropriate weighting of responses would be 

possiblee This would provide estimates of the 
| percentage of individuals in the universe that 
| give a certain response. 


in a party and data on party composition, the 
| 


It is the opinion of the authors that there is no 
fallacy in the approach endorsed of filling time gaps in 
data to complete daily datae It might be argued that there 
is less chance of adequately estimating traffic for local 
rush hours or other periods of traffic heterogenitye 
However, it is more reliable to time fill from ‘nearby' 
times of the same day rather than from identical time 
periods on similar dayse Since the composition of entering 
traffic usually shifts slowly over a period of one or two 
hours, time filling within a day should only result in 
relatively slight biases particularly when gaps are filied 
from both sidese Given that time filling has resulted in the 
completion of every survey session, it is walid to assign 
"manual weights'® to data which reflect how often each 
afternoon or morning of surveying could have occurred during 
a given part of the summer, on a particular day type (eege 
weekday ) at a particular gate compared to how often it did 
occure 

) In addition, there is no reason why the handback 
impution procedure used should not produce results as good 
as or better than simple inflation in correcting for non- 
response (see Figure 13)e Actually, since matches were not 
found for some records, it was necessary to inflate a number 
of estimates to obtain an estimate for the universee The 
authors hold that it was more accurate to inflate an 
estimate by 10% (by multiplying the estimate by leil), than 
it was to inflate a 50% response by 2-0. 


Reliability and Yalidity of Data 


In the preceding discussion it has been noted that 
there has been very Limited progress in assessing and 
understanding the reliability of estimates obtained from the 
park user survey, yet this field of investigation has 
progressed considerably compared with that of assessing the 
validity of such estimatese However, there persists the 
problem that even if we know, for example, the true number 
of camper nights that occurred in a park, it is not possible 
to discuss how an estimate of this number based on a survey 
reflects on the validity of the estimate unless’ the 
reliability of the estimate is knowne To determine whether 
or not the estimate is biased, rather than whether the 
estimation procedure is conceptually or mathematically 
unsound, the reliability of an estimate must be assessed 
before its validity can be determinede If the reliability of 
an estimate is known, and if an estimate of the reliability 
has been made, then a statistical test can be used to accept 
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or reject the hypothesis that the estimate agrees with a 
true valuee If the hypothesis is rejected, the validity of 
the estimate may be questionede If the hypothesis is 
accepted, the validity of the estimate may be accepted 
conditional upon there being a chance that a more subtle 
analysis would show that the estimate was not totally 
unbiasede Thus, if a tlarger sample were obtained or 
improvements were made in weighting procedures an estimation 
procedure that is adequate for certain sample sizes and 
weighting procedures could be recognized as unacceptable 
when more accurate estimates can be madée 


1e Validity of Entry Record Estimates 


There are some data available from alternative sources 
that permit a cursory examination of the validity of the 
procedures endorsed in this paper with respect to the 
weighting of entry records to estimate for the universe of 
Park visitorse Questions asked in the survey make it 
possible to estimate total entering traffic for surveyed 
periods, which can be compared with ‘' observed' entering 
traffic counted by automatic traffic counters Located near 
the entrances to many of the parks (Figure 12). 

Unfortunately, Figure 12 may be misleading in that the 
totals recorded by traffic counters are not perfectly 
accuratee A project is under way that has shown that traffic 
counts at one park gate were inaccurate by 100% because of 
the improper functioning of a pneumatic traffic countere It 
should be noted that there are many reasons for the 
discrepancies between counts of entering vehicles and 
estimates of these counts based on the park user surveySe 
These include such factors as: 


(a) the survey only covers part of the day and 
night while traffic counts at almost all gates 
are collected over 24 hours; 


(b) all traffic counts obtained by the use of 
pneumatic tube counters are biased by the axel 
count factor that must be applied to convert a 
raw count to vehiclese Error from this source 
can be high unless an extensive investigation 
is undertaken to provide an accurate 
conversion factor; 


(c) where lanes are not separated, exiting traffic 
may be counted as if it were entering: and 


(d) if either loop induction or pneumatic’ tube 
equipment is not properly adjusted it may 
systematically over-under-—count traffice 


Thus, Figure 12 shows that there are some parks for 
which there is excellent agreement between estimated entry 
counts and countse The recognition that no more can be said 
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concerning the validity of estimates and that there are 
serious problems with traffic counts», prompted the 
initiation of two projects: to derive the possible best 
traffic counts by analysis of all existing Parks Canada and 
other counts, and to examine every traffic counter's 
Location, calibrations maintenance, and other 
characteristics which effect its accuracye 


2e Validity of Handback Imputation Process 


The only available independent figures that are known 
to be accurate and can be ralated to questions asked in the 
survey, are those showing party nights of “developed party 
campground" USCe Figure 13 presents estimates and 
observations of this figure for Terra Nova and Ke jimkuJjik 
National Park based on 1973 resultse A research project 
which is underway will produce “split sample" estimates of 
the variance in the estimated figurese At present it is not 
possible to infer from Figure 13 whether the observed 
differences between observations and predictions can be 
accepted to be due to chance or if a bias (lack of validity) 
can be detectede 

The estimation of a campground use figure from survey 
results implicity tested the adequacy of the handback 
weighted proceduree However, unless weighting of handbacks 
altered the various categories of people in a way that 
changed estimates, Figure 13 actually shows the result of 
simply inflating handback results as opposed to using the 
imputation processe Figure 13 shows that the imputation 
process does in fact change estimates! It is to he hoped 
that this change is in the correct direction and that an 
advantage has been gained by imputing handbacks for the 60% 
of the people who did not return theme Inflating from a 46% 
response rate to 77% of questionnaires having handback meant 
that the very arbitrary assumption, that in making estimates 
for the uniwerse (or same subset of it) those people who did 
not respond would behave like "some average person", was 
made for only about 24% of all records rather than for 54% 

Work is currently being undertaken to more fully 
evaluate the handback imputation process determining if 
similar entry records tend to have similar handback recordse 
In this project, entry records with handbacks matched to an 
entry record are checked to see how similar are matched 
handbacks and how this similarity relates to what would 
occur by chance and by using other matching approachese 
Cluster analysis and multiple discriminant analysis are 
being used to test the "adequacy" of matchinge 


ESTIMATES OF VARIANCE 


The reliability of the type of survey described above 
was of considerable concern in Parks Canada's approach to 
user surveys to the extent that the design of the 1972 Park 
User Surveys was specified so that, sub-survey and its 
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replicate were built into the overall survey designs However 
in the end it was recognized that the incompatibility of the 
weighting system with the sub-survey and replicate, made it 
impossible to estimate variance using the original plane 
Instead, the results of the entire survey were split 
synthetically into two sub-samplese This was achieved by 
randomly dividing special weighted original entry records 
into two filese The error estimate was calculated as a 
function of the difference between the total weights of the 
two sub-samples using the formula: 


Standard deviation for 
SC A)= number of "X" based 
on entry records 


IMC i)—MC2)1 /2 


Standard deviation for 
S(B)= numbers of "X" based 
on handbacks 


IMC1)/p-m(2)/pl /2 


% error = 100(1-96) SCA or B)/(estimated number of "X"s) 


WHERE M(1) and M(2) are estimates of the numbers of nxt 
(number of visitor days, number of UeSe visitor days, etce) 
and p is the response rate for thandbacks for the park or 
park gate being considered and when 1e96 could be replaced 
by other values if one used to be Less or more certain of 
estimates in the way described subsequently 


The factor p appears in the formula because the way 
data were processed means that estimates made using handback 
information indicate the number of cases on which no 
information was available or imputede Then one must make the 
choice as to whether the wish to say that they assume the 
3000 people out of 10,000 who did not respond and were not 
similar to other people in terms of the imputation rules, 
behaved like the people: if estimates should be based on 
dividing observed numbers by (10,000 — 3000 )/10,000 = eTe 

One may be interested to note some of the accuracy 
estimates obtained for the accuracy in numbers of first 
entry visitors to several National Parkse Obviously, the 
accuracy in estimates obtained from handbacks is only 
presented to show how accuracy becomes poorer when one must 
use handback variables (because the sample is smaller )e The 
percent accuracy figures indicate that the predicted values 
have less than a 5% chance of being more in error than the 
value givene The numbers which follow provide a convenient 
guide to the highest accuracy to be expectede If one wants 
to know about the number of first entry UeSe visitors at 
Forillon for example, they are dealing with a group smaller 
than the universe on which there is data so the error in an 
estimate of the UeSe visitor can be expected to exceed 642%, 
the entry record accuracy figure given in Figure 14 (UeSe 
origin is an entry record variable )e 
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.ACCURACY IN ESTIMATES OF NUMBERS OF FIRST ENTRY VISITOR 
BASED ON 1973 NATIONAL PARK USER SURVEYS 


Estimates based Entry Handbacks 
on Park Records 


Terra Nova 1.3 5254 
 =KejimkuJjik 202 5-9 
Forittlon 602 23067 
La Mauricie 4.0 11.4 
Point Pelee 3e9 12.3 
Riding Mountain 1-5 11-9 


OTL OLOLOLS: 


CONCLUSIONS 


This paper has touched on a number of design and 
weighting considerations in park-user surveyse Stitl 
numerous matters raised in the sampling of relevant articles 
or recreation survey research given in the Appendix have not 
been broachede In particular, there has been a focus on a 
number of practical considerations that are often not 
recognized in survey designe In many park visitor surveys 
carried out today, and even in the exceptional surveys where 
objectives are related in a useable way to the questions 
asked, common problems include: 


le Results are biased in a way that is not 
corrected by weighting and cannot be corrected 
because the necessary data for weighting do not 
existe 


2e¢ Inefficient use of manpower occurs in spite of 
high levels of skills availablee 


3e Results are improperly or inefficiently 
weighted, when often there exists readily 
available information that can be used to 
improve the accuracy of weightinge 


Researchers must start to analyze their needs and find 
efficient ways of meeting theme If they do not take into 
account some of the points made in this paper, and use some 
of the survey techniques suggested as being practical in 
given situations, current problems with user research will 
continuee Moreover, given the growth in user research in 
Canada; both the number of problems and the amount of wasted 
resources will multiplye 

Many researchers continue to believe that a planning 
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decision made with some information is better than one made 
with nones In many cases the authors could not disagree 
more! The use of inaccurate information in decision making, 
used as if it were accurate, displays either ignorance or 
deceite The authors, one of whom is a statistician and the 
other a planner, endorse intuitive planning decisions where 
the data available are so inaccurate that estimates made 
using them fall outside the accuracy bounds desired by 
policy makers or plannerse It is hoped that this paper will 
bring closer the time when decision-makers are no longer 
willing to accept pretentious survey conclusions or, at 
least, pretentious claims to providing useful planning 
information when the vast majority of the information 
provided by the researchers is either inaccurate or remains 
unused, or noth? 

Researchers recognize the need for information to aid 
planning decisions, but even when they are able to provide 
data promptly they often provide biased information that may 
misguide the planning of a new project, confuse the 
evaluation of a project, or lead to the acceptance of an 
unwanted policye 

In this regard the authors recommend that researchers 
conducting user surveys order their planning and research 
priorities before collecting more new information of dubious 
accuracy at inflated costse It is possible today to define 
objectives in such a way that questions asked in a survey 
can meet the needs of the planners requiring the 
informatione In the 1960's, methodological problems were 
necessarily ‘solwed? by the imperative of making some 
planning decision rather than none at alle The constraints 
of the 1970's make it necessary, and expertise makes it 
possible, to define objectives more rigorouslye Once this 
has been achieved, it will be possible to accept the 
challenge to undertake unbiased research efficiently, and to 
produce accurate information while working within manpower, 
budgetary and time constraints. 
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CORD STUDY TECHNICAL NOTE 8 


A SCHEME FOR DECOMPOSING 
PARK ATTENDANCE LOADING CURVES 
AND RELATED ANALYSIS METHODOLOGIES 


Je Beaman, and Se Smith 


ABSTRACT 


The development of a scheme for decomposing park | 


loading curves (the seasonal trend of daily use) is 
describede This scheme allows one to estimate probable daily 
use of a park when one has incomplete data by extrapolating 
"through" the gap in existing figurese More importantly, the 
procedure is a useful addition to existing procedures for 
developing use models for parks in that it suggests a way in 
which a total predicted use figure for a given activity ina 
given park will be distributed on a day-by-day basise 

Park loading curves have been broken down into two 
separate curves: (1) a continuous loading curve reflecting 
weekday use patterns and (2) @ peaked toading curve 
reflecting weekend and holiday uses. The method is describede 
The reader is told such curves should be developed for 
specific types of users and for each park-origin pair when a 
model is based on survey data collected during a relatively 
few dayse The application of the procedure to study the 
effects of weather on attendance patterns through analysis 
of residuals is discussede 


PURPOSE 


The purpose of this technical note is to present a 
procedure developed for analyzing park use attendance curves 
for daily use of a park for a specific activitye 


INTRODUCTION AND OVERVIEW 


DecompoSing the daily use curves for parks, here called 
“Loading curves", into two components, weekday use curves 
and weekend (and holiday) use curves, has several possible 
applicationse A rather simple one arises because the 
construction of a curve describing the general nature of the 
seasonal trend in attendance at a park can provide the basis 
for making reasonable estimates of total attendance figures 
for a park based on partial monitoring of usee A more 
important and sophisticated application is the use of the 
procedure to help increase the specificity and predictive 
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precision of use models such as the Cesario model described 
in TN 4. Cesario's analysis was designed to determine the 
factors which contribute to a park's attractiveness and the 
factors which contribute to the tendency of a city's 
population to use a park (which Cesario called "the 
emissivity of an origin"). If one is te learn what these 
various park and city factors are, which influence a certain 
type of park use, it is necessary to begin by stating in 
adequately precise behavioural terms what use of a park is 
being considered, what origins and population groups are 
potential users and what might be important to them in 
choosing a park and activity. 

| Here there is a need to break down use between weekdays 
and weekendse Simply applying models to total use figures 
for a park can lead to considerable confusion when the 
purpose of analysis centres on the understanding of 
emissiveness and attractiveness parameterse This is so 
because most sites serve a variety of userse An analysis of 
total use figures for a site results in parameters for a 
meaningless “average" situation (see Reference 52)e If 
behaviour is to be accurately analyzed, it is necessary to 
disaggregate users of a park by type of use, time of use 
(weekend, weekday, holiday etce ), origin of users and 
possibly by several other variables. The problem of 
disaggregation is also discussed in TN 30 and in less detail 
in a number of other notes eege TN 14, 18, 40.6 

Consider for purposes of introduction a relatively 
isolated campground which is not being used to capacity. If 
this site is too distant from most of its potential users 
for day use, they must come on a weekend or during a 
holidaye The emissiveness of an origin, with respect to that 
park, will thus be different for weekdays than for weekends 
and holidays (Beaman and Leicester, see Reference 1, 
elaborate on the importance of time-budget constraintse See 
also TN 332) Attendance curves for the use of that park over 
re season (loading curves) can be developed to reflect day- 
by-day use and will more or tess resemble the curves’ in 
Figures 1 and 2- The curve in Figure 1 is more typical of an 
isolated park, as is evident from the great use on weekends 
in comparison to weekday usee Weekday use is almost 
negligible for most of the seasone For various reasons the 
park with the loading curve in Figure 2 receives much more 
use throughout the week; some may be enroute stopover use 
(see TN 18) and some main destination camping use such as 
that for which Cesario has developed a model (see TN 4)e 

) The basic point here is that whether a person from a 
given city goes to a given park on a weekday or a weekend 
depends on the amount of time he has available, which varies 
between weekdays and weekends and holidays, and depends on 
how far he is from the parke Persons from any given city, 
for a given activity ata given park, are assumed to behave 
according to the same kind of attendance function or 
patterne So an important variable in the study of the 
loading curve for a given "city-park relation" is the 
relative amplitude of the weekend curve compared to the 


| 
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weekday curvee This is a function of the distance between 
the city and the park, of the attractiveness of the park, 
and emissivity of the city for the different kinds of tripse 
As well, the capacity of a site is a factor to be 
considerede So looking again at Figures i and 2, two 
questions should be asked: (1) what causes the difference in 
the overall use pattern of the two parks, and (2) how is the 
pattern a reflection of the different perceptions of the 
park at different origins? 

In Figure 1, for example, the relatively small amount 
of daily use throughout the week comes from communities 
which are fairly close to the parke The peaks which occur on 
weekends are due to an influx of visitors from slightly more 
distant communitiese The amplitudes for these two separate 
components of the various ocrigin-destination use curves can 
be used to dewelop a Cesario-type analysis for weekday and 
for weekend usee And, to stress a point already made here as 
well as in several other technical notes, such disaggregated 
models have greater structural validity and thus better 
predictive devices than an aggregate model applied to data 
on a given park for all origins, for use at all times of the 
weeke Specifically, for example, it would be a mistake to 
use the same single-equation model to explain use for the 
park for which camping use is shown by Figure 1 (Rowan's 
Ravine) as for the park for which data are given in Figure 2 
(Meadow Lake )e For the majority of the readers who are not 
familiar with Saskatchewan parks, it is useful to note that 
there is Little to recommend Rowan's Ravine for weekday use 
from fairly far awaye Nor is it the kind of park that one 
would typically seek out for a couple of weeks of campinge 
It is, however, an acceptable and convenient park for a 
quick weekend campoute Meadow Lake Park, on the other hand, 
draws and holds people more for extended periods than for 
short weekend stayse It is distant from most origins from 
which users comeée 

To further elaborate on the development of a loading 
curve analysis procedure, it is necessary to point out that 
here the focus is on one type of use, main-destination 
camping, which occurs on both weekends and weekdayse The 
loading curve for a park serving these types of campers for 
a given origin can conceivably be constructed by summing two 
separate curves: 


(1) A continuous; smooth curve which reflects 
Loading by main-destination campers with 
relatively unrestricted time-budgets (this is 
termed the continuous Loading curve). 


(2) A discontinuous, spiked curve which reflects 
the Loading on the park by people with 
restricted time-budgets (this is termed the 
peaked loading curve )e 


The first curve can intuitively be considered to 
represent a seasonal trend in the average weekday use of a 
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ROWAN'S RAVINE FIGURE 1 
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park while the peaked curve reflects attendance on weekends 
and holidayse A more precise definition of these curves and 
the method for deriving them constitute 

It is important to stress that in actual planning and 
research problems it is essential that a set of toading 
curves be developed for each origin which significantly 
contributes to each type of use load of a park to be 
considerede The procedures developed and the example worked 
out here do not implicitly make this point cleare In fact 
the example presented, for reasons explained later, 
introduces some confusione 

Admittedly, all the disaggregation argued for in this 
note (breakdowns by each park in a system, each activity, 
each origin and time of use) will add to the number of 
calculations and data tabulations a researcher must handleo 
The authors argue, however, that unless extensive experience 
and testing prove otherwise, the additional work will pay 
off in better analyses being achieved with less data than 
would usually be requirede Analyses should allow more valid 
predictions and from the insights into the factors affecting 
use levels the researcher should be giving useful advice to 
planners and policy—makerse 


ESTIMATING LOADING CURVES AND AMPLITUDE PARAMETERS 
Estimating The Continuous Loading Curve U(t) 


Equation 1 defines the loading curve for a park as a 
Linear combination of a continuous loading curve U(t) anda 
peaked loading curve P(t)se The parameters A(j,l) and AC j92)9 
each specific to park j, determine the form of the loading 
curve of the parke The problem is how to derive: 


UC t), P(t), AC jsi1,0) and AC j,2,0) for each park-origin paire 


(1) YC jety,o) AC jo1,0) UCt) + AC §j22,0) P(t) 


WHERE 
YC y,t,0) 


participation at park j 
in the time period t 
from origin 0 


U(t) = the continuous loading curve 
P(t) = the peaked Loading curve 


AC j213;0), AC js2,0) = amplitude factors (A-factors) 
which are ccalculated for each park origin paire 


This representation is graphically depicted in Figure vt 
which shows the addition of a peakedy discontinuous curve, 
P(t), to form the total use curve, Y( jyt)e 

The reader may find it desirable to assume that the 
bell-shaped continuous curves represent a normal curve with 
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| 
the mean around July 25 (appropriate variance to bal 
calculated )e However, since “numerical methods" and 
computers are to be used in the analysis, no great benefit 
accrues from following such an approache So to determine 
U(t), it was originally assumed that the loading curve for a 
given park was quite flat in the July 1 to August 15 periode 
Curves for different parks could thus be normalized so that 
large flows to a single park would not dictate the shape of 
the uniform curve, U(t), estimatede However, the results 
presented in TN 19 now indicate that the variance in visitor 
flows into a park is proportional to the size of that flow 
if the totat flow is monitored, or to a multiple of the 
number of observations made if only part of the use of a 
park is monitored. If the concern is with day use, estimated 
total use and its variance are given by the formulas Listed 
below, based on results presented in TN 19-6 


Estimated total daily use 
2*( Observed number of day-use parties for that day) 
2Y 


X 


Variance in X = 4Y 


If one is considering continuous loading for overnight use 
and the figures being analysed are for total campground use: 


| Total use for weekday |? 
Variance in the continuous =|] --—---------------------- | 
load campground use | Average length of stay | 


for weekdays implied 


This is because total visitors/average length of stay gives 
an estimate of the number of entering partiese Regardless, 
one can suggest appropriate variance figures so that a 
weighted estimate of the uniform function for week days 
continuous use of a park for a given purpose can be derived 
bys 


where it is a weekday; and where the sum is over a 
collection of parks for which these are data for the day t3; 
and where the parks are parks that do not work at capacity 
on week days (or where data on a park is only included for a 
given day tt, when it was operating at less than 90% 
capacity; and N(t) = estimated use of a given type; Var(t) = 
variance in the estimated usee 


A variation of this formula was actually used in 
computing the results presented subsequentlye An "average" 
curve was formed by adding up the data for all the parks 
considered in a given analysise This average curve is 
generally irregular (due to error factors) and has gaps 
caused by Leaving out weekend dayse 

The next procedure carried out was the generating of 
averages to fill in values for excluded weekends and for 
weekdays for which (by chance ) there had been no 
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observations in any parke Gaps were first filled by 
inserting the average of the observation before the gap and 
the observation after the gape Then a smoothed curve was to 
be generatede This was to be done using a 5-point running 
average with weights 3,577;5,358 But examination of the 
computer programme used, indicated that the weighted average 
was never rune Still a curve was estimated with values 
filled in for weekends which is the estimate of the 
continuous loading curve U(t) used here (see Figure 3)e. In 
intuitive terms the procedure just described is a simple 
averaging processe 

In summary, the rationale behind what was done is that 
when it is assumed that all parks have a U(t) curve implicit 
in their loading curves, one way of estimating U(t) is to 
extract ite The average of all curves that contain U(t)'’s is 
obtained in an attempt to average out “error effects" that 
lead to any particular park giving a poor estimate of U(t)- 
When U(t) has been obtained for non-holiday weekdays it is 
possible to use estimated values of U(t)'s for weekdays to 
“extrapolate through" holidays and weekends. This is because 
U(t) is considered to be a smooth curvee In other words; use 
of a park ona weekend is due to the addition of an extra 
Load or demand on top of the "average" weekday usee Finally, 
even after a U(t) curve has been built up, it may be 
expected that weather oor other factors hawe resulted in 
irregularities in UC t) that should not occure These 
irregularities can be removed to a certain degree by 
smoothing, but obviously a 5-day running average does not 
correct for August being a bad month all over Canada ina 
given yeare This matter is taken up subsequentlye 


The estimation procedure used to determine A( jyl,0) for 
a given park-origin park employs a least-square fitting 
approach where data for weekdays are fitted to the relation: 


(3) Use at t from 0 to j = AC jx1,0)U( t de 


The computations are carried out according to Equation 4 
where the weights, W(Ci), reflect how much weight one wishes 
to put on given observations using the hand of weighting 
rules introduced earliere 


(4) WHERE 
AC j,1,0) = ( OE W(t, j,o UC t YC j,t,0))/ 
( @E UCt)2WOtsj,0)) 
A( j2130) = amplitude of the continuous loading curve 


for park j 
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FIGURE 3A 
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y = weekdays for which there are data for the 
given park, there may be no flow from 0 to j 
but this just means Y( Jj, t*, 0) = O 


wit®,5,0) = weight to correct for varying accuracy of 
the estimate (Y) 
UC t ) = the estimated continuous Loading curve 
Y( jot ,O0) = estimated use for day to 
Estimating the Peak Loading Curve P(t) 
Given the continuous’ loading curve U(t) and its 


amplitude A(j,l) for park j,» one may simply subtract A(j;l) 
U(t) from the observed toading curve Y(j,l) for weekend days 
only, to obtain an estimate of the weekend curve Z( j,t! Je 
This relationship is expressed as follows: 


(S) Z0jot20) = YC Get) — ACGol,O) UCt) 
WHERE t = a weekend daye 


A procedure analogous to the one employed to derive 
U( t+) can then be used to derive P(t), except that there is 
no final smoothing of P(t) and the curve is not assumed to 
be smooth but rather peaked and not smoothe The 
discontinuous function P(t) is zero, by definition, on non- 
holiday weekdays (t* )e 

It may be argued that the assumption of a U(t) curve 
moving smoothly through weekends and holidays has 
theoretical deficiencies since the increased use on weekends 
aifects the "true" continuous Loading usagee However, the 
procedure introduced here is suggested as at least offering 
a good first approximation to the behavioural pattern 
actually involved in park usee Nevertheless, the peaked 
function shown in Figure 3 was derivede 


Estimating A@.is2)o 
the Amplitude of the 
Peaked Loading Curve 
for Park J 


Given that the peaked curve has been determined from 
Z( jot,»O0) by an averaging procedure Like that described for 
obtaining U(t), one can use a formula similar to Equation 4 
namely: 


(6) AC jo2e0) = € OE Wetyj20) ZO Get, O0) pl t)/ 
( OE Wityj90) PC t)2) 


WHERE the summation is over te 


TN & CHAPTER VII page 513 


THE DATA FOR A TEST ESTIMATION 
AND THE RESULTS OF THE TEST 


The data used in actually estimating the kinds of 
functions described above were from the Canadian Outdoor 
Recreation Demand Study's 1969 Park Users Surveye Data were 
collected in various parks from April to September for a 
total of 184 dayse Analysis concentrated on main-destination 
campers, although of course, the procedures described here 
can be applied to other types of usese Information about the 
CORD Study Park Users Survey is available in Volume III-« | 

As described there numerous difficulties were 
encountered with the CORD Study Park User Survey both in. 
field work and in information processinge Because of data 
collection problems, one should consider the estimates — 
derived here as only itlustrative of the procedures — 
developed and not as valid predictions. | 

The few negative park specific peak function amplitude 
values in Table 1 can be called zeroe One will note that 
these values occur for parks that essentially have no 
weekend peakinge On the other hand, Birds Hill Provincial 
Park in Manitoba and several other parks listed, in 1979, 
had a weekend use use amplitude over 2-1/2 times its weekday 
use amplitude. 


DISCUSSION 


Now that the method for estimating the coefficients and 
functions of concern here have been presented, there may 
still be serious concerns about the significance of the 
results and how to use theme One of the simplest 
applications of the results obtained is their use in 
estimating the total attendance at a park or components of 
this attendancee The area under the uniform curve can be 
determined, as can be the area under the peaked function 
(ege by mechanically adding it up using graph paper)e When 
the areas under these curves are multiplied by amplitude 
factors the results are total use estimates for the periods 
for which area was computed.e This is true when the original 
data have been weighted and processed in such a way that 
they represent the universe for which totals are to be 
generatede All the amplitude factors do is tell a person how 
many times the basic area defined by the peak or the uniform 
function is included under the function for the particular 
park being considerede 

If, as before it can be assumed that estimation can be 
carried out in such a way that multiples of the uniform and 
peaked function giwe the actual origin-specific loading 
curve for a park, then observations of the amount of use on 
certain days define the appropriate amplitudes for the 
peaked and uniform functione Thus there isa reversible 
procedure if the uniform and peaked functions are knowne One 
can use a function that has been obtained to make estimates 
of use on particular days on which there were not 
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. TABLE 1 


AMPLITUDE FACTORS FOR SELECTED PARKS 
FROM DATA COLLECTED DURING 
THE 1969 CORD STUDY PARK USERS SURVEY 


Province Name of Park Al A2 
Quebec Bon Amie 38673 63-08 
) Mont 50.83 57.05 
) Oka 48.72 34244 
Vincennes 25633 2e41 
Stone Ham 30-69 17-35 
) Matere 17253 6219 
) Tremblant 53204 81.92 
| 
Manitoba Bird Hill 51.40 135.37 
Watchorn 5263 12.49 
Helco Island 6254 43203 
Rivers 12-16 53-69 
Grand Valley 24663 —-0012 
Spruce Woods 6e21 —0e51 
Rainbow Beach 19-80 7020 
Clear Water 2037 —-0.07 
Wekusko 2e 82 —0-08 
Alberta Wabamum 41223 22-68 
Crimson Lake 48.50 147.218 
Bow Valley 19-09 48235 
Bragg Creek 4097 42.05 
Willow Creek 8203 13.92 
Beauvais Lake 6279 63017 
Cypress Hills 34.38 22043 
Chain Lake 14.53 45222 
Dutch Creek 5281 23-15 
Lethbridge 11.73 32288 
BeCe Bamberton Bee 26¢39 30-91 
Golden Ears 7067 441.65 
Shuswap Lake 109-97 44.16 
OL OTOTSOLO, 


observationse However, when one recognizes the possibility 
of doing this one considers that if it rains on a given day 
or there is something else about a given day such as cold 
temperatures that is known then it is possible to infer that 
an estimate based on the uniform function would be in errore 
Some of the variation around an unconditional predicted 
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walue is understandable in terms of information that is 
known after an evente If one knows how weather usually 
varies, one can see that a profile can be deweloped as to 
how use is tikely to vary, for example, taking into account 
how weather causes the expected value to deviates This 
latter matter is taken up subsequently. 

For now, consider that a researcher is able to 
recognize a day which may be considered a typical week day 
or typical weekend day for the usual summer seasone In the 
context of the preceding paragraph this day can be described 
as a day on which the park will have “expected attendance". 

Now, consider that it i@ desirable to get estimates of 
the expected use ef a park based on a minimum use of 
information; one can make the conscious choice to coliect 
data on days where the expected or typical use of &® park 
will occure From such use data it is quite conceivable that 
the shape of the toading curve, actually just the peak 


function amplitude and the centinuous loading function 


amplitude, can be determined very accurately, say from only 
two or three weekend days and two or three weekdays of 
observationsse One is net concerned with having 20 days of 
data collection or some other relatively Large number tc be 
able to counter balance the effects of bad weather on one 
survey day and exceptionally good weather on another cata 
collection daye Obtaining information about the effect of 
weather could be @ separate project which could be carried 
out with equally small amounts of information to be used to 
estimate the parameters in a more generalized model 
(introduced subsequently )e If parks are not working at 
capacity so that uniform loading curve appears to be 
appropriate for explaining use it seems clear that 
researchers should take advantage of similarities in the use 
patterns for different parks that reflect Similarities in 
behaviour of people from different origins and minimize the 
manpower that (1) is tied up in collecting data and (2) is 
subsequently tied up in its processing» etce The 
distribution of flow thecry in TN 19 is relevant in 
determining how much data collection would be necessary to 
expect a certain accuracy in park use tables or other 
estimates to be madee Obviously, one source of systematic 
error that may occur when use of a park is estimated using 


uniform functions and peaked functions is that the 
representation (the model) is incorrect because the park is 
operating at or near capacitye Actually, the uniform 


function for a park could give very good resuits while 
weekend peaks could be so high that during the middle of the 
season the capacity of @ park is exceeded so that people 
from certain origins do get into the park and other people 
from other origins do not get into the park: the people who 
can get there early from the close origins get in while 
those people who would ordinarily come to the park but would 
get there too tate to get a camping spot dco not comee 
However, such problems can be easily recognized as long as 
any kinds of decent records allow one to know if a park did 
operate at capacity and how frequently it operated at 
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; 


capacitye 

The preceding paragraphs have been explicit in 
suggesting that it may be possible to use the loading curve 
method to define some rather efficient ways to obtain 
information on parks and thereby to make estimates of total 
season use at a minimal coste Still the authors, at this 
point in time, accept that this application may be 
questionablee Yet it is no more questionable in terms of the 
accuracy that will be produced than what is obtained in many 
costly surveys that are now carried oute Recent work at 
Parks Canada has confirmed that with very extensive 
expenditure of manpower in park visitor surveys, total use 
figures for many parks still have a high probability of 
being in error by more than 4 or 5 percent on park use 
totalse When one begins to disaggregate information to get 
origin-destination flows, user types, etce it is possible to 
imagine that the accuracy figures produced from surveys 
involving 5 to 10 thousand interviews at a park lead to a 
distressing concern for the usefulness of survey resultse As 
is indicated in CORD study Technical Note 2i there is 
definitely room for creative innovations in defining ways to 
measure park uS@€@e Whether one method or another is 
appropriate depends on what kind of use one is trying to 
monitor and one's objectives in monitoring ite If one is 
trying to monitor developed campground use, then campground 
registration forms with origin information are obviously the 
place to get datae Large amounts of information collected 
can be obtained and processed very cheaplye However, 
measuring day use of a park presents drastically different 
problems, particularly when parks have numerous entrances 
and numerous day use arease [In trying to assess the amount 
of day use ina park it may be very plausible that one of 
the most accurate approaches to obtaining use information 
and one which can involwe the least manpower is to collect 
License plate information on cars entering a park on 
particular days with "expected attendance". By also 
obtaining information on license plates for those vehicles 
of people who stay in the campground it is feasible that day 
users’ vehicles could be identifiede Incidentally, as part 
of the visual recording of information; one can record the 
number of occupants in vehiclese So the use of a license 
technique to get day-use information in conjunction with the 
total or partial enumeration of users of campgrounds 
provides a good alternative to surveys or other methods that 
have been used in the past to obtain park use informatione 
License plate data can be collected for locations other than 
park entrances and campgrounds so researchers not only get 
the usual origin-destination information but also profile 
information on what visiting parties have donee One test has 
also used the selection of plates with certain final numbers 
or ltetters to draw a manageable sample size from a large 
universee 

But, to return to the main theme of loading curves, 
there is the possibility that, for example, weather effects 
are being confused with errors resulting from the way the 
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model isa specified. Although some procedures can be used to 
remove residuals that appear to be a result of the 
deficiencies of the model, this avoids the issue of why the 
model faitage . 

It may be noted that since there can be seasons or 
months oft adverse weather, care is necessary to avoid 
confusing tong term weather patterns with structured errore 

The following discussion assumes that the residuals 
vary because of the influences of the weather. Consider the. 
class of models incorporating weather factors C(j,1i,t) andl 
Cl jg2,t) defined by: 


C7) ¥OG,*) = 
: 
CC joist) AC jel) UCt) + CO gelyt) AC js2) PCt) + EUCt) + EP(t) | 


WHERE | 
EU(t) and EP(t) are “error terms" reflecting "“naturai” 
variance of the continuous load and peaked Load, 
respectivelye ; 


Also the conditions below are satisfied: 


it 


(Jeli) EC C( jgi1st)) = E(C( je2,t)) 1 


and 


Ht 


€(7e2) ECEu(t)) = EC EP(T)) 0 

Equation 7 implies that attendance Y(j,t) for a given 
park j and for a given day t, ig a function of continuous 
Load and peak load, modified by weather conditions. 

The use of Equation 7 raises many issues that are not 
immediately obviouse If it is recognized that there can be 
good and bad seasons in terms of weather, the A(j,l) and 
AC j,2) as determined by the procedure outlined previously 
assume an average season (e€cege C( jJziyt) = 1 for i= 1,2) for 
the days surveyede If the season was not averaged, there is 
an “identification problem" gince the AC j,l) and C(jet,t) 
become interrelated when the suggested estimation approach 
is employede However, if the C~functions are known or if 
Simultaneous estimation techniques are employed, the 
identification difficulty can be overcomee 

Issues also arise regarding the distribution of the 
EU(t) and EP(t) error termse In estimation there should at 
least be consideration of the variances of observations and 
consideration of problems introduced by serial correlations 

Regarding the C(j,1,t) and Cl js2,t) and their 
statistical relationship to the A-factors, one may note that 
the C-functions are ina certain sense random variables that 
fluctuate about 1e04« One way to express these variables is: 
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: 
Ms) CC j,i) = 15810571, 47 VCC) a= 152 


WHERE 

V(t) refers to weather conditions at time +t (see Reference 
| 

1). 


To a good approximation one may expect that an 
estimation problem may be stated as: 


(9) COjsist) = 1 + @E (BC jsisk) DC jsiegk)) 


WHERE the sum is on k 
WHERE D( jeiygk) is the deviation of weather variable k from 
an appropriate site specific mean based on an average 
seasone 


One may also consider partial derivatives of Equation 8e In 
Equation 9 the B(j,isk) are then assumed to give, at least 
to a good approximation, the nature of the “response 
surface" relating to behaviour of people in response to 
weather in the neighbourhood of the "average point". 
A couple of closing observations about the difficulty 
of studying weather effects are in ordere People who have 
come a long distance and planned to stop at a given park are 
not in a good position to alter their plans at the tLtast 
minute because of poor weathere Parks which serve mainly as 
main-destination campgrounds for communities within a fairly 
Short distance would exhibit total loading curves more 
directly tied to the weathere Parks which have national 
drawing power, however, would have loading curves not as 
closely tied to daily variations in weathere This emphasises 
the importance of disaggregating use figures by origin or 
type of visitor before trying to explain behaviour on the 
basis of use figurese From another perspective people not 
only respond to actual weather conditions, but also to their 
perceptions and anticipation based primarily on short-term 
forecasts of weather conditionse A forecast for a rainly 
weekend when in fact the weather turns out to be sunny and 
warm is a situation where a lower level of use than would be 
expected than if the forecast had been accuratee 

Referring to Figures 1 and 2, the reader will notice a 
relative depression of use of both parks in late July and 
early Auguste Meadow Lakes exhibits a more sustained dip in 
attendance, reflecting its use as a Long-term, main- 
destination campgrounde It is several hundred miles from 
large communities from which users come and few campers are 
likely to drive that distance when the weather has been bad 
and is predicted to remain so for some timee Rowan's Ravine 
draws campers from 30 to 40 miles awaye In this case the 
decision to use Rowan's Ravine can be made quickly if there 
is a break in the poor weather for a weekende Without 
belabouring this point, it should be clear that the 
Situation of a park vis-a-vis the origins of potential 
campers will affect the use of that park in many diverse and 
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often subtle waya. 


CONCLUSION | 


Regarding one purpose of this paper, the derivations of 
functions for the estimation of weekend and weekday use, one 
can see that establishing the kind of peaked functions and 
continuous functions for different parks for different 
origins gives one insight into the fundamentals of park usee 
The amplitudes of these functions are obviously better 
objects for analysis than total use estimatese Age well, sums 
of such functions for various origins for a given park give 
a curve that can be used by planners and managers in 
estimating the use of a park on a daily basis. These matters 
relate to the general need for researchers, planners and 
managers to take time to develop an understanding of the 
components of park use figures for it is these, rather than 
the gross figures, which are really useful in answering 
research, planning and management questionse What is more, 
the fact that relatively Limited data can he used to develop 
a broad understanding of general use patterns at parks makes 
the procedures introduced in this note not only conceptually 
useful but potentially analytically powerful in the 
statistical sense that it allows one to make efficient use 
of informatione 
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CORD STUDY TECHNICAL NOTE 10 


ANALYSIS OF CORD STUDY NATIONAL SURVEYS 
ON PARTICIPATION IN OUTDOOR ACTIVITIES 
TO DEFINE (1) CLUSTER OF ACTIVITIES AND 
(2) AGGREGATES OF PEOPLE WITH SIMILAR PARTICIPATION 
PATTERNS: SIMILAR ACTIVITY PACKAGES 


Re Gillespie, Je Beaman, Ge Romsa 


AUTHORS 


Authorship of this TN has been by a number of people 
and it is for this reason that the following notes are 
providede The material presents two rather disparate points 
of viewe Therefore the reader should note that the material 
on the Burton approach to analysis, which was written by 
Gillespie, is not necessarily accepted or condemned by other 
people who wrote material for other partse Beaman wrote 
some of the material on cluster analysis and edited 
additional material on other writings by Romsa,; Currie, 
Peebles and Whitee Beaman takes responsibility for his 
interpretation of the writings of these other authors in 
editing parts of their report "Recreation Activity Packages 
Derived From the "1969 Household Data!" into this articlee 
However, by indicating these persons a8 authors, their 
substantial research work in deriving clusters and reporting 
on the derivation of these clusters is acknowledgede 
Peebles'® work in arranging for the computer analysis to 
derive aggregates of people using cluster analysis is 
further acknowledged by presenting his discussion of cluster 
analysis programs only slightly edited from the way that it 
appeared in the original report to Parks Canadae 


ABSTRACT 


This TN is a straight forward documente Back ground 
information about two strategies for analyzing National 
Survey information are presentede In the one strategy 
proposed by Burton, factor analysis is used to define groups 
of activitiese The other methodology is one for defining 
clusters of people with which activity packages can _ be 
associatede The paper presents the results of applying 
these two analysis approaches to the 1969 Canadian Outdoor 
Recreation Demand Study National Survey data on people's 
participation in outdoor activitiese Results of 
“clusterings" by the use of the cluster analysis technique 
based on the information statistic (2¢I1) are presentede The 
results are for Canadian residents based on their 
participation in 26 outdoor recreation pursuitse The (2¢@1) 
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analysis indicates that when individuals were broken into 
groups based om the activities in which they participated, 
eight activity packages can be determinede In contrast to 
the results of the (2¢1) analygis, factor analysis produced 
a number of “clusterings of activities". The clusters found 
allowing six factors in an oblique sclution when 
participation in activities was analysed were described as: 
(1) physically active activities requiring little equipment, 
(2) physically passive activities with attractiveness of 
“general” destination areas but develops a site specific 
MEAGUPE 


PURPOSE 


The purpose of this paper is to present the results 
obtained when data on people’s participation in actiwities 
is analyzed in two different ways, both of which hawe been 
described as giving useful information about the 
interrelationship between recreation activities as these 
relate to planning. Some would say that two methodologies 
that can be used to study substitutability between 
activities are presentede 


INTRODUCTION 


Ihe Burton Method of Clustering 


A general review need not he presented here of the 
issues that arise in relation to the analyses of peoples 
recreation regarding (1) substitutability, (2) equity of 
access to recreation opportunities, (3) the need to consider 
groups of people in making projections of future 
participation, and some other matterse These are covered in 
articles by Burton (Reference 7), Beaman (TN 37), Hendee and 
Burdge (Reference 32), Romsa (Reference 49), Beaman and 
Lindsay (TN 32) and in literature cited in these papers. 
Here, all that is important are some details about the two 
methods of studying substitutability actually discussed. 
Many of the views of Burton introduced in the following 
paragraphs are questioned in other works (TN 32, 37) so 
their presentation should not be taken as an endorsement of 
using the factor analysis method: the dispute is not over 
the general ideas but over the appropriateness of the factor 
analysis methodologye 

Burton's analysis method may be considered to he 
founded on the technique of defining hierarchical groups of 
activities based on the participation patterns of 
individuals, a technique first introduced by Proctor, then 
extended and refined by Burton. (See Reference 7.) Taking 
results from 1,056 respondents, Burton applied r-—mode factor 
analysis to their scores of participation or non- 
participation in various activities, thus combining the 
activities in question into groups of highly correlated 
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pursuitse According to this theoretical approach, once the 
groups of activities have been established and the 
underlying roots or characteristics of the activity group 
identified, a “recreation type" has been describede Burton 
suggests that all individuals who are closely related by 
their participation pattern to a certain group of activities 
belong to the recreation typee As an extension of the 
process, he suggests that the socio-economic characteristics 
of various individuals could be analyzed to reveal any 
relationships between certain variables and the recreation 
types of individualse 

| He maintains that the definition of “recreation types" 
is an important step towards overcoming some of the problems 
associated with attempts to measure and express recreation 
demand in a manner which can be truly useful to the planner 
‘concerned with recreation policy, investment and facility 
provisione He also states that it is the kind of approach to 
analysis which he proposes is a basic prerequisite to any 
attempt to relate motivation or any other variables (as 
socio-economic) directly to activitiese 


| Burton defends the view that from a practical 
perspective activity groupings derived from the analysis of 
participation data are a useful tool for planners 


responsible for making decisions related to the supply of 
facilitiese He states that decision-makers can be provided 
with a hierarchical list of activities, with all members of 
@ group being the most viable substitutes for other group 
memberse Furthermore, he indicates that by comparing the 
‘groups with an inventory of existing facilities, 
ideficiencies in the supply for certain "recreation types" 
can be recognized, resulting in the provision of a more 
satisfactory mix of facilities and activities to accommodate 
a wider range of participantse 

It is his view that the identification of "recreation 
types" may facilitate the prediction of future demand for 
various recreation facilitiese "If participation in any 
particular pursuit can be linked to participation in other 
pursuits, and to certain socio-economic characteristics, 
then, as the socio-economic characteristics of a given 
population change, it should become possible to predict, at 
‘least in general terms; how participation in given pursuits 
‘and groups of pursuits will changee" (See Reference 7e) The 
‘use of activity groups is defended because "these groups 
‘serve as a more stable base for forecasting" than individual 
activitiese This is because participation in e group is less 
subject to fluctuation due to change in the influential 
factorse If, for example, due to achange in personal 
disposable income, an individual changes his participation 
from one activity to another, it is highly probable that he 
will change to another activity in the same group as the 
first. This change then, would not affect forecasts based on 
activity groups as much as it would those based on 
individual activities, leaving the activity group method the 
‘more stable of the twoe (See similar points made in TN 13, 
32. ) 
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Burton describes the activity perticipation approach to 
“recreation types" as waluable in drawing out the critical 
factors underlying participation im certain activitiese 
Analysis of characteristics of campers, for example, will 


result in a description of campers but it will do tittle to 

explain why the people are participants in the camping | 
activity instead of ancther, and how campers differ 
consistently and significantly from participants in other 


activitiese 


it is argued that the use of analytical techniques to 
form activity groups identifies the critical characteristics — 
which make people choose among activitiese The groups being — 
established on the basis of being common choices of many 
individuals, the characteristics most prominent in all 
activities in a group are those which identify the nature of - 
the group: that which binds it together and makes it a 


choice distinctive from other types of pursuitse 


Table 1 is an example of the results obtained by Burton — 


in applying factor analysis to British participation datae 


TABLE i 


SOME RELATIVELY STABLE RECREATION GROUPS 


Group I Group II 

SS oe Ss SS a oe oe ee sa SS SS SS 
Seccer Roller Skating 
Cricket Ice Skating 
Table tennis Youth Club 
Tennis Horse riding 
Group IIIf Group IV 
Rugby Picnicking 
Athletics Driving in the 
Cycling countryside 
Basketball Gardening 
Keep Fit Dining out 


Badminton 


OSLO LOLOLG, 


The Clustering of Individuals: 
A Second Analysis Approach 


The preceding has introduced the reader to one 
perspective on how the inter-relationship between activities 
can be considered quantitatively so that results obtained 
allow one to take into account the inter-relationships 
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petween activities when one plans for facilities, programs 
Rtcé The perspective concentrates on activities and 
intercorrelations between participation rates in different 
activitiese A different approach to the problem concentrates 
on a similarity between peoples In the remainder of this 
-paper the work by Burton is referred to as defining clusters 
of activities whereas the use of cluster analysis (as 
‘described below) is referred to as producing groups or 
aggregates of peoplee It is said that these groups are 
defined on the basis of the activities in which members of 
an aggregate participate. 

The conceptual model behind this second perspective is 
that people naturally divide into groups on the basis of the 
activities in which they participatee The focus of attention 
is not on the interrelationship between activities but 
rather, in terms of the way some analysis programs work, on 
defining which people are alike according to the activities 
in which they participatee 

| From a practical point of view one may consider that if 
there is information about an individual that gives’ the 
number of times he participated in each one of a number of 
activities during a given year, this person can be compared 
'with other peoplee Let us say that for each activity a 
/person categorizes the participation information to indicate 
that he did not participate, participated a Little bit, was 
a fairly regular participant or was a frequent participante 

| One may proceed to define a way in which a person's 
"status" in terms of these categories of participation in 
/'various activities may be used to compute a "distance" 
petween individualse Obviously one of the simplest ways is 
to give the level of participation categories a value 
between 1 and 5 and then simply compute distance by taking 
differences between category values, between two individuals 
on an activity, by activity basis, squaring these 
differences and adding them up as indicated below: 


distance X to Y = dE ((score for individual X for activity a ) 
—~ (score for individual Y for acitvity a))? 


_ WHERE Summation is over all acitivitiese 


Though there are numerous reasons to criticize such an 
ad hoc approach to defining a distance function (or metric), 
the reader is asked to consider that something Like this is 
what could be done in defining distances between 
individualse Then, with a measure of distance computed 
between every individual, the clustering of individuals 
problem becomes putting those individuals who are "close" 
together into groupse This is what one may visualize cluster 
analysis programs as being about (the reader interested ina 
- more sophisticated discussion should see _ the appendix to 
this Note and the Literature cited there )e 
| When a person has either manually (see Reference 28) or 
| by a computer sorted out the people in a universe into 
| clusters (aggregates), one can see from the way that the 
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clustering has been done that clusters are aggregates of 
people who are relatively homogeneous in their behaviour in 
terms of the activities in which they participatee Of course 
this is homogeneous in relation to the way that a particular 
distance function was defined and on the distance between 
individualse When using this type of clustering, rather 
than obtaining a group of intercorrelated activities 
(activities with a high factor loading on a given factor) as 
the first analysis output, cluster analysis as described 
here produces groups of people with whom a collection of 
activity may be associatede It is this collection of 
activities in which these people ina particular aggregate 
participate that here is called the activity package of the 
people in a particular aggregate. 


THE DATA 


The data used in this study were the 1969 CORD Study 
data on Canadian's participation in outdoor activities. 
Details on the data collection, editing etce are given in 
the CORD Study Data Documentation Volumee In summary, 3000 
interviews were carried out with people resident in Canadae 
The data shown in Table 1 were collected with information on 
participation for the year preceeding November 1969, which 
is when the interviews were carried oute 

It would be nice to be able to claim that the sample 
was a random sample of Canadians 18 and over, but this is 
not quite the casee Sampling proceeded in such a way as to 
yield a random sample until an exact interview tlocation was 
selectede The manner of chosing an interview location and 
the manner of selecting individuals after that (including 
the use of replacements for non-response and sampling 
quotas, and not at home weighting) result in a sample that, 
at least technically, is not a random samplee Still, 
comparison of weighted survey results with the census show 
good agreement on comparable variables. 

One paper has been written (see TN 24) commenting on 
the reliability and validity of the data collected in the 
survey of 1969 and similar surveys in 1967 and i972. The 
reader may find the results presented there to be of some 
interest as they show possible large discrepancies between 
independent studiese One may also wish to compare the 
similar survey of 1972 with Ontario Household Survey (see 
Reference 44) resultse This shows good agreement between a 
much better designed and executed surveye 


DEFINITION OF ACTIVITY PACKAGES USING FACTOR ANALYSIS 


Factor analysis can be used in data analysis for a 
variety of purposes but in this application, the object is 
to identify an underlying pattern of relationships suggested 
by the correlations between a collection of variables. 
Veldman (see Reference 59) explained how, “in some 
situations, factor analysis may be considered a procedure 
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For exposing the essential, determining constructs behind a 
set of observable behaviours", but here factor suggests an 
tbstract "form" of recreation behaviour (see Reference 4)e 

The explanation of the mathematical principles and 
»perations on which factor analysis is based is a fairly 
s-omplex task, and it is not really appropriate or feasible 
in this paper (see Reference 29)e However, it is necessary 
to comment on the technique of rotatione The factors 
»xtracted initially by most factor analysis programs are 
orthogonal: that is, they are uncorrelated or, represented 
praphically, they are at ninety degrees to each othere The 
first factor defines the most important underlying dimension 
(in terms of the percentage of variance explained); the 
second factor is the next most important dimension 
orthogonal to the first, and so One However, the resulting 
structure of factors is not statistically unique and can be 
transformed, rotated, into many equivalent definitions of 
the underlying dimensions all of which explain just as much 
vyariancee Infact, sometimes it is desireable to depart from 
an orthogonal solution to an oblique onee Such a solution 
often has factors which are composed of more closely-— 
correlated variables, although 6 factors are no Longer 
totally independent from each othere 

When the factor analysis procedure proposed by Burton 
was applied to the Canadian Outdoor Recreation Demand Study 
data described above, activity packages such as those 
presented in Tables 2 through 7 were derivede The selected 
results summarized here represent 5 and 6 factor orthogonal 
solutions to the 26 activity set (Tables 2 and 3)y a 
6-factor obliquely rotated solution based on 26 activities 
(Table 4), and a i16-factor orthogonal solution based on 718 
activities representing different levels of participation in 
the original 26 activities (Table 6)e. In the T78—-activity 


set, for example, “swimming Noe i" represents the "0-5" 
level of frequency of participation while "swimming Noe 2" 
represents the "6-10" Level of frequency of participation, 


and “swimming Noe 3" represents the tmore than 10" levele 
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TABLE 2 


Factor Analysis Groups Based Upon 26 Activities and 5 Orthogonally Rotated Fastors 


Factor I (Group I) Factor Loading 


|Sightseeing F fSS! 
Other Park Use -61 
Historic Sites Use Ah | 
Picnics 47 
Pleasure Driving 41 
Swimming - 30 
Photograph - 28 
| Snowmobiling -.03 
Snow Skiing -.01 
Hunting apne 


Factor me. 


(Group II) 


Water Skiing Sod 
Snow Skiing 47 
Power Boating - 46 
Canoeing 41 
Tennis - 40 
Sailing craks! 
Golfing = sul 
Swimming 


Horseback Ridin 
Pickup Camping 
Picnics 


(2) 
(2) 


Factor III (Group III) 


Ice Skating OL 
Snow Sledding -49 
Bicycling -46 
Swimming ° 


Horseback Ridin 
Sightseeing 
Nature Study 
Sailing 


Factor IV (Group IV) 

See EOE CE EE aT SS EA 
Snowmobiling -43 
Hunting 32 

Tent Camping 25 


Pactor V 


N 2 


ature Study 


(Group V) 


Pleasure Walking 35 
Climbing 34 | 
Photography _ 

Ice Skating 

Water Skiing -.04 
Snowmobiling -.02 


Power Boating maa 
eS el 


Factor Loading rank 


Notes: Weakly or negatively associated activities below double lines. 
in parentheses for activities listed in more than one group. 
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The method of determining which groups an activity 
should belong to was necessarily fairly subjectivee In the 
formation of activity packages, each variable or activity 
was placed in the group (factor) on which it had its highest 
Loadinge Not all activities, however, Loaded strongly on 
only one factore Because some activities are ags#gociated with 
the underlying dimensions of more than one activity group, 
they loaded moderately on the factor for ail these 
particular groupSe In order to give recognition to all the 
dimensions of alt activities, those which are "closely" 
related to more than one factor were placed in all those 
groupse In the tables, activities with multiple group 
membership thave the rank of their Loading indicated in 
parentheses. For example, see Swimming in groups I, II and 
Tif in Table 4e In rare cases, where an activity loaded very 
high on a factor to the point where it would be an important 
group member, but loaded much higher still on another 
factor, it was placed in both groupse See, for example, 
Power Boating in groups III and IV in Table 4. 

Just as the highest loading activities on each factor 
can be used to describe the characteristics of that group, 
so can the tlowest loading activitiese These most weakly 
associated or even negatively associated activities may be 
thought of as not possessing the characteristics represented 
by the activities included in the factor or, if loaded 
negatively, they represent the antithesis of what the people 
who score positively on the factor Likee In the case of the 
78-activity set described subsequently, where each of the 
original 26 activities is represented at three different 
levels of participation, each individual can only 
participate at one level (3 x 26 = 78)e If one is recorded 
as participating at the Swimming-3 level, for example, he 
cannot be a participant in Swimming-1 or Swimming-2- 
Therefore, for any given activity of the 78, the most 
negatively associated activities will be the other two 
levels of the same activitye In interpreting the group 
characteristics through the lowest or negatively loaded 
activities, the other levels of any group member are, 
therefore, ignorede 

As stated above, many alternative "solutions" to the 
problem of extracting the underlying dimensions in the data 
were developed for both the 26 and 78 activity data sets. 
Only a summary of results is presented in this papere The 
choices of the type of analysis applied (othogonal or 
oblique ), and the number of factors in the accepted 
solution, were based on an attempt to derive actiwity 
groupings which were both statistically viable and 
analytically meaningfule 

To determine the most acceptable solution, then, for 
each activity set, all solutions for that set were compared 
on the basis of four tests. First the "reasonableness" of 
each activity grouping in the solution was considered to 
determine in which solution the activities of each group 
were most closely related in terms of associated 
characteristics such as cost of participation and 
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relationship to naturee The second test, which was also 
subjective, involved a consideration of which solution was 
most representative of all the solutions for the particular 
activity sete 

The final two tests concerned the statistical 
properties of the solutionse Test three was a consideration 
of the eigenvalue of each factor ina solution, or the 
amount of variance explained by a factor relative to the 
original variablese A factor with an eigenvalue of less than 
1-0 has weaker power of explanation than one of the original 
variablese The final test was the percentage of total 
variance explained by the various solutionse 

Of all the different factor analyses performed on the 
26 activity set, the obliquely rotated 6 factor solution 
(Table 4) was the most acceptablee Strictly on the basis of 
reasonable groupings, all solutions were fairly acceptablee 
Indeed, the groupings were so stable throughout the various 
solutions that there were only minor distinctions between 
solutions on both criteria of reasonableness and 
representativenesse The S and 6 factor orthogonal solutions 
(Tables 2 and 3) were almost identical except for the 
combination, almost intact, of two groupingse Because both 
groups consistently appeared separately through all 
solutions of more than 5 factors, a 6 factor solution was 
considered preferablee In a 7 factor orthoganal solution, 
(not shown in this paper) the groupings again remained 
almost unchangede The extra factor extracted was very 
Similar to one of the other factors, and was, therefore; 
deemed unnecessarye A 7 factor oblique solution (not shown 
in this paper) only saw increased Similarity between the two 
factors; making them more redundant and reinforcing the 
acceptance of a 6 factor solutione 

The two final tests supported, in their consideration 
of variance, the acceptance of a 6 factor solutione Only the 
first 6 factors had an eigenvalue equal to of greater than 
1-<[. The acceptance of only a 5 factor solution would mean 
ignoring an important factor, while the 7 factor solution 
involved consideration of one factor which had Less valuable 
powers of explanation than each of the original 26 
variablese In terms of total variance of the original data 
explained, the 6 factor solutions revealed 44e3%, the 5 
factor solution 4062% and the 7 factor solution 48% « 

Little change in the groupings resulted from _ the 
application of an oblique rotation to the six factors, 
although their order, or relative weight in the solution was 
changede Factor III became the first and most powerful 
factory, with Factors I and II becoming II and Iil 
respectivelye Horseback riding was dropped from Factor Pi. ot 
the orthogonal solution, but remained with Factor IIIe 
Tennis was transferred from Factor II to Factor ItIe The 
matrix of correlations among factors in the oblique solution 
(Table 5) shows fairly low correlations in general, the 
largest being between the two factors between which the 
exchange of activities took place, indicating that the 
exchange had Little impact as far as changing the essential 
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nature of the factors is concerneéde 

In the oblique solution (Table 4), Hunting is 
associated to much the same degree with three factors III, 
IV and Vi, rather than only Factor IV of the orthogonal 
solutione Swimming is also associated with three factors tc 
a similar degree, an indication that there are distinctly 
different motivations for participation in Swimming; as well 
as hunting, which are revealed by the descriptions (which 
follow shortly) of the different factors with which the 
activity is associated. The Hunting in Group III, for 
example, may be motivated by the physical activeness of 
Hunting, while the Hunting in Group $&IV may be more 
associated with the desire to establish mastery over nature, 
and the hunters in Group VI may be attracted by the camping 
and “outdoorsmanship" elements of Huntinge 

An oblique solution is more realistic for the purpose 
of identifying recreation types as it does not assume that 
the factors are completely unrelated to one another, andy 
Since there were in this case only a few logical differences 
from the orthogonal solution, the oblique, 6-factor solution 
groupings were accepted as the best explanation of the 
underlying relationships in participation patternse The 
next step was to consider the activities in each group to 
determine their common characteristics (see Table 4)e. 

Group I includes physically active winter activities 
requiring a minimum of facilities and equipment, and 
physically active summer activitiess All except tennis can 
be pursued on an individual basis but can also involve 
groups of individual participantse Very Little organization 
is required and cost is minimal, although tennis and 
horseback riding cam, in some cases, require both 
organization and moneye All activities are enhanced by some 
degree of skill and can be pursued at a competitive level, 
excepting possibly snow sledding tobogganing~-sleighinge The 
Lowest scoring activities are characterized by their passive 
naturee 

A physically passive dimension emerges in Group IIe All 
activities but swimming and outdoor Photography involve 
travel as an essential part of the pursuit, and, as a 
result, tend slightly towards a rural settinge ALL but 
swimming are directly related to simple appreciation of both 
natural and man made environmentse Only outdoor photography 
tends to be expensive and requires any degree of skille 
These characteristics are reinforced strongly by the Lowest 
Loaded activities, which are very active; expensive and 
skill-—-orientede 

Group III's activities are highly physically active as 
emphasized by the weakly associated pursuits: picnicking and 
pleasure driving, for examplee The activities are water 
based, except for snow skiing,y and facility orientede Ail 
except swimming are expensive, require a great deal of 
skill, and involwe personal risk and danger to at least some 
degree, with the added exception of golfinge All involve 
individual participation, but usually ina group settinge 

Group IV is characterized by a need for freedom or 
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raccor 

Loading 
Factormai (Group X11) 
Phetograpny=2 Pee) 


Pickup Camping- = 


Water Skiing-3 206; 
Canoeing-3 - 105 
Horseback Riding ay S. 
Pecctor ex bem Group Xi11) 
Pickup-Camping-2 ~24 
Tent Camping-2 mee 
Swimming-l eles} 
Trailer Camping-1l ALS 
Pickup Camping-3 «10 
Water Skiing-2 7 ee 
Breyeling=3 sale 
Pleasure Walking-3 a 
Climbing-1 = 
Ice Skating-2 =. 09 
Golfing-3 0S 


Factor xiv (Group TV) 


Playing Tennis-3 . 34 
Snow Skiing-3 2 
Sailing-l 525 
Golfing-3 LG 
Golfing-2 eS 
Nature Study-2 =k 
Swimming-2 = Oo 


Pleasure Driving-3 =o 


‘TABLE 3 


FACTOR ANALYSIS GROUPS BASED UPON 78 ACTIVITIES 


AND 16 ORTHOGONALLY ROTATED FACTORS 


Factog 
Loading 
Factor XV (Group XV) 
Tent Camping-l es 
Trailer Camping-1l 5 aD 
Pickus, Camping— 1 « 2) 
Snowmobiling-3 Bae: 2 
Sehbealenl sks Gre Ea: ome 2 eee ok See red EE 
SGM noeloe C= = > SA: 
Photography-2 =—* 07 * 
Pren co) Uy 
Nature Study-2 = 06 


Factor XVl (Group XV1) 


No members 

NOTES: Weakly or negatively associated activities below double lines. 

Factor loading rank in parentheses for activities listed in 

more than one group. 

Appended numbers 1 to 3 indicate depth of participation 
categories. 


See text explanation of depth categories. 
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mobility in the outdoors combined to some extent with a 
certain element or feeling of man conquering, mastering or 
rising above nature. This is rein-forced by the presence of 
activities such as sailing, climbing and park visitation 
that involve harmony with nature as negatively associated 
activitiese The group activities are physically active, 
expensive and require skille All involwe an element of speed 
and/or danger, and take place ina rural settinge 

Appreciation of and harmony with nature are common to 
Group Ve Hunting and snowmobiling are among the lowest 
Loaded activities on this factore There is Little 
requirement for facilities or equipment other than a camerae 
The activities can be physically active or passive and 
generally to involwe an element of bothe They also tend to 
be individual pursuits requiring some degree of skill or 
knowledge and are theavily concentrated in the rural 
-environmente 
| Group VI involves travel and outdoorsmanshipe ALL 
pursuits are active to some degree, and require some 
knowledge and ekille Facilities and equipment are 
prerequisitese All pursuits occur ina rural setting and 
tend to be group activities, although individuals can 
participate alonee The presence of bird watching~-nature 
study, hiking-walking for pleasure, and sailing as the least 
related activities may indicate that the majority of campers 
may be motivated by other factors than the appreciation of 
nature, as one might have expectede Possibly, the need for 
cheap accommodation on holidays, and mobility are most 
important in the growth of campinge 

For the purpose of monitoring the effects of depth or 
participation on the groupings, 4-factor analyses in total 


were applied to the 78-activity set yielding orthogonally 
rotated solutions of 10, 16 and 18 factors, and an obliquely 
rotated solution with i14 factorse Only the 16-factor 
solution is presented in this papere Both the 14- and 
16-factor solutions had a tenth factor which measured a 
dimension not easily described by the activitieso No 
activities in the 16—-factor case and only one in _ the 
14-factor case loaded most highly on this factore The final, 
sixteenth factor also measured a dimension to which none of 
the activities was most directly related, so the 16-factor 
solution resulted in only 14 actual groupingse The i6-factor 
solution was also a good representation of all 4 solutionse 
Since we ignore the Factor XVI of the 16-factor solution, 
there is an explanation of somewhat less than 36-e5% of the 
total variatione The following is a discussion of the 14 


groups produced by the 16-factor solution (refer to Table 
—6)e6 


Group I is characterized by general Light participation 
in a large number of activities all of which are very active 
in the physical sense and require some skille Alt of the 
solutions of the 78-activity set produced a somewhat similar 
groupe 

Heavy participation in the passive appreciation 
activities comprises Group IIe All activities are generally 
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cheap with the possible exception of photography, and none 


involve risk or dangere The negatiwely associated | 
activities, heavy participation in ice skating and 


snowmobiling, are both active and can involve riske 


Group III also involves heavy participation, but in. 


active, non-water-based activities requiring a fair amount 


of skill. Light participation in pleasure driving and | 
moderate participation in active water-based activities 


score negatively. 


Another more general Light participation group emerges 
as Group IV, this time of passive appreciation activitiess 
Some Knowledge, but generally Little skill, is required by) 
these pursuits which occur in a rural environment and 
include travel as an integral part of the activitye Group V_ 
has only two activities but is am indication of a very 
strong dimension of moderate, highly travel related | 


passivenesse This nature is emphasized by the negative 


Loading or heavy participation in the specific purpose _ 


activity oft visiting historical Sites, and heavy 


participation in very active and sometimes risky water 


Skiing and ice skatinge 


Heavy participation in active, water-based activities | 
requiring skill is measured in Group Vie These activities, © 
sometimes involving the elements of speed and danger, are 
countered by the wery passive nature of the negatively > 


related pastimese 


Group VII*s only member, extensive pleasure driving, is 
another indication of the powerful automobile-related 
passive factor in Canadian recreatione Those who do a great. 
deal of pleasure driving tend not to participate heavily in) 
active pursuits such as tennis and snow skiing, which score 


negatively on this factore 


Heavy participation in a movement into the outdoors — 
factor is indicated by Group VIIIe Travel is an integral | 
part of these activities, which generally require some 
equipment and can be active, but are not usually physically | 
demandingse The negatively associated activities are active | 


and sometimes involve speed and personal riske 


Group IX sees a recurrence of the active water-related _ 


activities, this time at a moderate level of participatione 


Snow skiing still is a member of this group because of its 
active and speed factors, and hunting and climbing are 


related, probably by their active naturee 


Factor X does not produce a group as there are no 
highly related activities for ite Group XI is defined by the | 


moderate tlevel of participation in the active, non-water- _ 


based activitiese The group is quite similar to those formed 
by the other lLewels of participation in the same type of 
activities: Groups I and IIIe 

Moderate participation in passive-appreciation pursuits 
forms Group XIIe Sightseeing and pleasure driving; which are 
usually in this type, have combined to produce the travel- 
based passive Group Ve 

Group XIIilI sees emergence again of the camping- 
outdoorsman factor, this time at the moderate participation 
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levele It is interesting that camping with a pick-up camper 
dis a member of this group at both moderate and heavy levels 
of participatione This occurence indicates that there is 
Little motivational difference behind moderate and heavy 
pick-up campinge 

Active, skill-~requiring activities comprise group XIVe 
All are at the moderate or heavy participation level except 
sailing, which is also the only water-based activitye 
Perhaps the availability of accessible water faciiities 
limits participation in sailing, which remains attractive to 
this recreation typee The presence of golfing at. both 
moderate and heavy tevels may indicate that, generally, 
there is a common set of attractions for golfers who 
participate more than five times per yeare 

Group XV is the light participation camping-outdoorsman 
group, but is joined by two activities that are physically 
active and at the heavy tlevet of participation« This 
combination may result from a relationship between 
occasional camping and heavy participation in snowmobiling 
and hunting, e@es camping may sometimes be a part of the 
latter activitiese Factor XVI, like Factor X, does not 
produce a viable groupe 

Several interesting characteristics of Canadian 
participation patterns have emerged from the analyses based 
on depth of participatione The most striking, perhaps, is 
that Canadians tend to confine their participation in all 
their activities to the same level; those who participate 
heavily in certain activities tend to participate heavily in 
all their activitiese This trend is revealed by the fact 
that, with only a few exceptions, groups were formed by the 
same level of participation in the member activitiese 

Depth of participation in activities would seem to 
depend more on the individual's general desire to 
participate than on the nature of the activities themselvese 
While there certainly must be motivational differences 
between marginal and heavy participation in an activity, 
these differences seem to apply to all activities within the 
individual's range of participatione That iS, those 
motivations which prompt a person to participate extensively 
in one activity also result in his extensive participation 
in his whole range of activitiese Thus activity groupings 
remain very similar at all tevels of participatione 

Only a few activities were placed at one lewel ina 
group comprised of a different level of participatione This 
placement resulted from various causes, including the dual 
nature of some activities such as moderate climbing whose 
characteristics are illustrated by its membership in both 
the active Group IX and the passive appreciation Group IIe A 
basic difference between one activity and a& group of 
otherwise highly related pursuits may be a second cause of 
the grouping of activities at different participation 
levelse The inclusion of only Light participation in sailing 
in Group XIV, possibly due to its water requirement, has 
been mentioned as an examplee A third cause may be the 
occasional participation in some activities in the course of 
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pursuing others, as thas been mentioned for snowmobiling, 
hunting, and camping in Group XVe Finally, a fourth cause 
may be the arbitrary delineation of the different levels of 
participatione Golfing and camping with a pick-up camper 
have been mentioned as activities to which the division 
point of ten occasions of participation between moderate and 
heavy participants may not be valide 

While Canadians seem to be either light, moderate, or 
heavy participants in their whole range of activities rather 
than light participants in some of their group of pursuits 
and heavy participants in other, they also seem to he very 
dependent on passive recreation relying on the use of the 
automobile. Of the 14 viable groups resulting from the 16 
factor analysis, several possessed an element of travel, and 
two (Groups V and VII) were formed only by moderate pleasure 
criving and Sightseeing, and heavy pleasure driving; 
respectivelye 

In all of the analyses of the 78-activity set, the 
first factor, and therefore the strongest dimension of 
Canadian outdoor recreation participation, was light 
participation in a broad range of activitiese The members of 
this group varied slightly from analysis to analysis, but 
generally they were active outdoor recreation pursuitse 

In an attempt to determine the Stability of the 
groupings based on depth of activity, the groupings produced 
by all four solutions involving 78 activities were comparede 
Twelve groups were formed by members that were associated 
with each other in at least three of the four analysese They 
are listed in Table 7. 


DETERMINING AGGREGATES OF PEOPLE: CLUSTEK ANALYSIS RESULTS 


One of the easiest ways to understand how a particular 
cluster analysis proceeded and how to interpret the results 
is to took at Figure 1 and to see how the sequence of events 
depicted relates to the actual cluster description presented 
in Table 8« The reader who is knowledgeable about cluster 
analysis will see from the “tree" in Figure 1 that the 
clusters were formed by what is described in the Appendix as 
a monothetic divisive algorithme This was done because the 
use of this algorithm was relatively cheap andin that 
respect appropriate for a preliminary studye However, as 
indicated Later, there is good reason to believe that an 
alternative algorithm should be used in subsequent studies 
if only to avoid the biases that arise when terminal 
clusters become a function of the sequence of division that 
is carried oute 

From Figure 1, one sees that the initial split of the 
universe of 1100 people (used because of available core 
storage in the computer) was on swimminge By breaking the 
population into two groups on the basis of whether they did 
or did not participate in swimming, it was possible to get 
two more homogeneous groups of people: more homogeneous in 
terms of the activities in which they did or did not 
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FIGURE I 


NATIONAL ACTIVITY PACKAGES BASED ON 1969 CORD STUDY NATIONAL SURVEY DATA 
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participatee A group of people who did not swim (not 
swimming by a "No" on the right-hand branch in the figure) 
was further divided on the basis of whether they did or did 
not go picnickinge As shown by the “maximum 2¢I" axis, the 
splitting on picnicking further contributed to defining 
homogeneous groupse Other splits were made on the basis of 
visiting historic sites, driving for pleasure, sightseeing, 
walking and water-skiing, and can be seen from the figuree 

Dividing people into groups in the way suggested means 
that some of the aggregates, which have been called terminal 
clusters, will have people who do not participate in certain 
-activities and some of them wilt have people who do 
_ participate in certain activitiese This is the way that the 
_clusters were defined in this analysise This is obvious from 
| Table 8 when one looks at the percentage occurance figures 
and sees that, for example, in cluster 2, 100% of the people 
_ drove for pleasure. Driving for pleasure is a defining 
characteristic of this. cluster along with non-participation 
in picnicking and non-participation in swimminge So, also 
one sees that there was no participation by people in 
cluster 2 in the activities swimming and picnickinge 
However, one only really learns something new when one sees 
that for cluster 2 there was also no participation in the 
activity sailing or in the activity pick-up campinge These 
latter activities do not have zero participation by 


| definitione Having zero participation conveys real 
information about other activities that people in this 
aggregate do or do not participate ine For further 


illustration, cluster 1 of Figure 1 indicates that not 
| swimming, not picnicking and not driving for pleasure are 
defining characteristicse In the percentage occurance 
figures in Table 8, one sees that zero participation in 
these activities occurse Zero participation also occurs in 
| pick-up camping, water skiingys snow skiing: very little 
participation, even compared to people in cluster 2, occurs 
in many other activitiese So a picture begins to emerge of 
the people in cluster 1 being far less active than people in 
cluster 2e 

| The results presented in Table 8 are interesting ina 
number of respects that have not been covered in the 
descriptive discussion abovee They certainly make clear that 
there are groups of population that have different activity 
packages, so a planner should not think of the population as 
a homcgeneous body of people who have a certain probability 
of participating in each of a number of activities 
independently of other activities they participate ine The 
importance of this in making projections is commented on 
earlier in this paper (also in TN 13, 29, 32)- 

Still, the analysis leaves something to be desirede The 
algorithm used does not allow some people who may 
participate in a number of the activities that are included 
in one of the more active clusters to be in that cluster 
unless he does or does not participate in the two or so 
critical activities that define that clustere This is 
because a monothetic divisive type of cluster algorithm was 
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employed to define clusterse But the influence of supply on 
what people participate in may automatically eliminate 
people from a cluster because, where they Live, it is not 
convenient to participate ina certain key activity which 
defines a clustere 

One who has taken some biology will recall that the 
classification problem is one of defining critical 
characteristicse Many characteristics are not critical in 
determining whether a certain plant should be grouped with 
other plantSe At first examination it may even appear 
curious that certain plants or animals are grouped togethers. 
This kind of consideration carries over to studying the 
groupings of recreation activities if one wants activity 
clusters for aggregates of people if clusters are to be 
truly behaviourally meaningfule It is the belief of the 
authors that if one wants to consider substitutability and 
recognize the fact that it may be operative for different 
individuals who would be in the same cluster, it is critical 
to carry out cluster analyses with a "natural class-—seeking 
algorithm” (see Appendix)=s 

In terms of guidelines for further research one may 
note that it is possible to achieve more natural clusterings 
by incorporating supply information into a cluster analysise 
The way that this can be done is to use existing supply 
information on two different areas to form a weight that is 
used in determining how significant the difference between 
participation in a given activity is between the areas in 
which live the two people being considerede It two persons 
who Live in different areas are being compared, then instead 
of simply taking the difference between some numbers that 
reflect whether a person is a non-participant,; or a regular 
or irregular participant and squaring these and adding them 
up over all activities, one may take this squared value and 
weigh the difference by reducing it if there is a difference 
in supply that reduces the difference in scores (depending 
on whether the difference in supply in the two areas is 
Large, medium or small, defined according to some arbitrary 
scale that is set up)e In particular one might use the kind 
of potential map generated for camping in TN 5, but for each 
activity have a similar kind of map so that the potentials 
in every geographic area would be knowne On the basis of 
such a map differences would be taken as reflecting true 
differences if the potential were within 25% of each other 
and in other cases a correction would be introduced to say 
that the differences between the people and the activities 
in which they participate, and to reflect supply and thereby 
the participation level of one person, would be shifted 
towards the potential of the other person by varying degrees 
depending on how much the potential for the activities 
actually did deviate between the two arease 

This proposal does not get into the more tricky matter 
of substitutability or what some people refer to as 
substitute supplye Until some work is done in the more 
straightforward way Just suggested, and until actual 
behavioural information becomes available on what 
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substitutions people do make andwhy, it is not really 
plausible to make a suggestion about how to take into 
account substitution in defining clusterse At this point it 
is simply important to note that substitution must be 
considered when one is aiming at defining natural classes as 
| opposed to defining classes which, when corrected for 
_ availability of supply, may have a strong bias toward 
meeting certain mathematical conditions rather than telling 
one truly about behavioural influences that are operating on 
_ people. 
In this discussion the matter of examining socioc- 
| economic characteristics of people who are in various 
| aggregates that are determined using cluster analysis 
| programs have almost been totally ignorede In the 
introductory discussion, particularly that based on the work 
by Burton, the value of knowing socio-economic 


' characteristics was stressed with respect to the matter of 


making projectionse This theme is taken up in CORD Study 
Technical Notes 13 and 32, in particulare Here, it is only 
necessary to note that in work already published by Romsa 
| (Reference 49) results have been presented on the 
characteristics of people in relation to the aggregates to 
_which they belonge It is not a difficult matter when a 
person has been classified into a given cluster to carry out 
analyses of the individual clusters to see what 
characteristics people have, but it is not a simple matter 
to characterize a cluster by the characteristics of the 
people who are in it because there may be a wide range of 
ages of people and there may be wide deviations in the 
socio-economic characteristicse Giving a “typical profile" 
based on average education, average income, etce for a 
cluster is not a very adequate way of indicating what socio- 
economic characteristics are really important with respect 
to a clustere In fact, there may be somewhat the same 
problem associated with characterizing clusters as there is 
in saying what activities really constitute the important 
activities in defining an activity packagee The income 
variable may not be important for one activity even though 
people in the particular cluster involwed have an above 
average income: it may in the case of this cluster be the 
rural urban variable that is important and the income 
difference may only express the highly urban concentration 
of people in a particular cluster. Regardless, it should be 
clear from depreciating examples that the issue of 
characterizing clusters is seen as important andis yet 
another area that must receive attention if practical use or 
even academic uses may be made of the procedures) for 
deriving aggregates of individuals from information on their 
participation in various activitiese 
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CONCLUSION 


From the results presented in this article ene can see 
that the two analysis strategies used produced results that. 
are quite different in terms of what they mean, but which 
may be confused. Clearly when one speaka about activity 


packages; unless there is general acceptance that this is — 


i 


| 


not a collection of inter—correlated activities determined 


by R-mode factor analysis, there is the possibility that one 
will misunderstand what is meante Similarly when a list of | 
activities is presented and it is Said that these define a | 


“recreation type", one may become confused and not realize 


that this is the collection of activities with which people | 


have been associated in the way described by Burtone Thus 


one can see that from a practical perspective it is very | 


important to indicate whether lists of activities define 
groups in the sense of Burton or whether the lists relate to 
the activity package of a group of individuals in the spirit 
of the other methodology. 

Obviously, the two different sets of results do not 
have the same implication for plannings Knowing that a 
certain proportion of the universe of individuals is 
characterized by participating in @® certain group of 
activities means that a planner can know what Proportion of 
the universe being studied is served by a certain activityse 
By looking at cluster after cluster he is able to see how 
important a certain activity is in the different activity 
packages of the different &gagregaters of people into which a 
universe has been dividede He can even put this importance 
in perspective by presenting figures on the size of the 
various aggregatese He may say that for 15% of the 
population a certain activity is one of the two activities 
usually participated in (these people have only P 
activities) while for 30% of the population this activity 
was only 9% of their activities (meaning these people had 
about 12 activities in their activity packages). Such a 
statement could be made for several groups with activity 
packages ranging in size from 8 or 9 activities to 15 or 16 
activities. 

The reader may wish to attempt to formulate for himself 
an example of where the Burton cluster of activities 
information would be usefule As indicated in the beginning 
of this paper, there are other studies that comment on the 
uses of cluster analysis and on the Burton factor analysis 
methodology (TN 32, 37, Reference 49-) Since these matters 
neea not be pursued here the example just introduced is not 
elaborated one However, it is in the context of the example 
and the kind of planning concerns Just raised that the 
future of the kind of research presented here lies. 

This article has only shown the feasibility of certain 
Lines of analysis without pursuing in depth even what the 
implications of carrying out these analyses aree An obvious 
next step is the preparation of papers where actual 
management applications of one or other of the techniques 
are madee Another area for further research is that of 
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theory development in relation to which of the models should 
be used and in what circumstancese This involves clarifying 
the behavioural considerations relevant to the use of factor 
analysis in cluster analysis as has already been begun (see 
TN 32, 37 d)e 


APPENDIX 


DESCRIPTION OF SOME PROGRAMMES 
USED IN CLUSTER ANALYSES 


Ce Peebles 


Cluster analysis is a term which covers a multitude of 
strategies and algorithms designed either to group similar 
| individuals as a function of their attributes or, more 
rarely, to group a number of related variables scored over a 
series of individualse Excluding parametric multivariate 
techniques such as factor analysis and joint metric space 
('and partial order scalogram analysis, Wishart (Reference 67) 
has classified cluster—analytic strategies into two basic 


| groups: (1) "natural class" seeking algorithms and (2) 
optimum solution methodse The first of these major divisions 
ihas received attention from only a few workers — (eege 


|Wishart (Reference 65, 66, 67) and Jardine and Gibson 
| (Reference 33)-e It is the second of Wishart's two categories 
that has seen the bulk of activity in cluster analysise This 
'is largely because it offers solutions that are optional in 
| terms defined by the investigator rather than in "natural" 
_terms that are defined by complex methodological and 
-epistimological dimensions of the problem and datae ( See 
|Reference 33) for a discussion of "natural" class problems 
in numerical classification of biological organismse In the 
'assignment of individuals to species, the requirements of 
both proper taxonomic assignment and phylogenitic grouping 
/must be mete ) 

Within the second major group of clustering strategies a 
further division can be madee As Wishart says: 


Probably the most common technique is the 
hierarchic fusion algorithm which has the 
advantage (although very expensive with large 
populations) in presentation of the resulting 
"dendrogram". The construction of “keys" was a 
requirement of botanical applications that gave 
rise to the early monothetic divisive techniques, 
enabling observers to identify plant communities 
by the presence and absence of certain key 
speciese The third recurring technique eee 
improves a given classification by itterative 
relocation of cluster members so as to optomise 
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some objective measure of overall homogenity in 
terms of the similarity between individuals and 
clusterse Also known as the "X-mean", "transition" 
and “euclidian cluster” methods it is economical 
in computer processor time and appears to find 
global optimum solutions for most Small 
populations. 


in general, most hierarchic-fusion algorithms have in 
the past been called potythetic-agglomerative to contrast 
them with the monothetic-divisive strategiese Polythetic— 
agglomerative clustering groups individuals on the basis of 
a measure of similarity or dissimilarity generated from the 
measures of their attributes. Based on these measures of 
Similarity or dissimilarity between all members, clusters of 
related individuals are grouped by a “rule" in the 
algorithme An almost inexaustable array of measures of 
relationship between individuals measured over both binary 
and continuous attribute states have been usede The methods 
~ or rules - for grouping individuals have been equally 
large: single, average, and complete Linkage, errorsum; 
information gain, and cliques, clumps, and stars to name e4 
fewe In each case the measure is either minimized or 
maximized as a criterion for inclusion of an individual in a 
group or for the fusion of two groupse 

The monothetic-divisive algorithms work in exactly the 
opposite mannere Instead of being built into more and more 
inclusive groups a population of individuals is partitioned 
into more and more homogeneous sub-setse Williams and 
Lambert (Reference 62) the originators of monothetic-— 
divisive analysis in plant ecology, defined the problem as 
the subdivision of 


cece a population so that all associations 
disappear; but there will in general be a large 
number of alternative subdivisions fulfilling this 
requiremente We therefore propose the concept of 
efficient subdivision, by which we intend 
subdivision of that species which, in the two 
subclasses resulting, produces the smallest total 
number of residual significant associationse 


They propose X* as the measure and the variable with the 
highest @E X* as the divisore That is, if the population is 
divided into individuals with and individuals without the 
attribute which shows the highest 2E X® the largest number 
of associations will be eliminatede The resulting sets will 
be the most homogenous of any possible pair drawn from the 
initial populatione 

The use of @E X*, however, has several undesirable 
propertiese 


eee in the qualitative case we have become 


increasingly dissatisfied with the X2 model, which 
has proved excessively sensitive to skewness of 
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the underlying distribution; as 4 result, the 
Simultaneous possession by an individual of two 
uncommon attributes assumes a quite 
disproportionate importancee (Reference 37) 


Lance and Williams, Orlocki (Reference 46), McNaughton-Smith 
(Reference 42) and others have proposed the Information 
Statistic 2¢I be used in place of aE X2. The Information 
Statistic asymptotically approaches the X2 distribution when 
the population over which it is computed is largee The 
Information Statistic 1 can be symbolized 


gI( AB) = 1(132)(AB) — 10152)(A) — €152)(B) 


WHERE the I's represent CHI(1) the mutual information 
between two frequency distributions CHI, and CHI(2) 
representing the row and column classifications in 
table A or B in their union A t+ B (Reference 42 )e 


For a 2 x 2 table where both variables are binary, 2¢6I 
becomes 2( BtC) log(2) where B and C are the upper right and 
lower left cells of the tablee The population is divided on 
that attribute for which the o0E 21 is tlargeste Such a 
division results in the greatest amount of information fall 
in the two resultant populationse 

Cluster analysis, whether agglomerative or divisive, 
monothetic or polythetic, is a data reduction devicee The 
sampling distributions and statistical properties of various 
techniques have not been worked oute Thus any statistical 
inference based solely on the results of a cluster analysis 
is suspect at beste If; however, the results of a cluster 
analysis fulfill some theoretical expectation or are 
subjected to further, external statistical testing then 
cluster analysis can be a very powerful techniquee The 
programs used in this study were part of Wishart's CLUSTAN 
IA Suite and were run on the University of Windsor's IBM 
360-0506 


TN 10 CHAPTER VII page 54% 


CORD STUDY TECHNICAL NOTE 19 


OBTAINING EFFICIENT ESTIMATES OF PARK USE 
AND TESTING FOR 
THE STRUCTURAL ADEQUACY OF MODELS 


Je Beaman, JeLe Knetsch,y HeKe Cheung 


ABSTRACT 


This paper provides an examination of the problem of 
heteroscedasticity as it relates to estimating park wuse-, 
although the results can also be applied to a wide variety 
of flow problems involving traffic, people or commoditiese 
The major issue is that estimates of flows obtained using 
ordinary teast squares, OLS,» often yield statisticatly 
significant results while still giving rise to targe 
differences between observed and predicted flows. 

The paper presents results which show that for the flow 
estimation problem of concern, more accurate use estimates 
may be obtained by using generalized least squares, GLSs 
rather than using OLSs Weights to use ina GLS regression 
are deriwede These are presented in a covariance matrix 
which is developed taking in to account the variance to be 
expected in origin-destination flowse 

It is shown that deriving the correct weights, 
estimates of variances, to use in a regression analysis 
results in an “absolute” test for the structural 
appropriateness of the regression modele Tests related to 
the "absolute" adequacy test are introduced and their use to 
identify specific structural problems with a model is 
illustratede 


INTRODUCTION 


Increasingly, more formal methods of estimating 
attendance at proposed parks and recreation areas are being 
used in the planning and justification of such arease 
Ordinary Least squares regression (OLS) models are 
characteristically developed to estimate the relationship 
between (1) measures of the use of parks and recreation 
areas, usually specified as the volume of origin-destination 
flows for a number of existing sites, and (2) various 
independent variables influencing a park's usee These latter 
variables are usually chosen to reflect the characteristics 
of the sites under consideration and to be measures of the 
size and proximity of populations from which visitors comee 
(See References 1, 2, 9 and 10-¢) 

While the "relations" established by these kinds of 
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regression are statistically significant, there is typically 
an undesirable lack of precision in the prediction of actual 


origin-destination visitor flows (attendance figures )e 
Estimates often differ from observed values on which they 
are based by several hundred percent" thus there is a 


problem with the accuracy of estimatese (See References 5, 
6e) 

| The difficulty can be illustrated by examining the 
origins of visitors to a provincial park in Saskatchewan 
(Rowan's Ravine) and employing a simple model to explain the 
variation in flows as a function of the distance from the 
visitors, origins to the park (see Table 1) and the sizes of 
the populations at the originse The total observed 1969 
attendance at this park was 9,828 parties, coming from 
fifteen different origin arease However, of the total, 5,868 
parties (or about sixty percent) came from a single origin 
area located fairly close to the park and containing Regina, 
the largest population-centre in the regione 

Ordinary least squares regression was used to estimate 
the parameters in Equation le The relationship between 
various flows from the different origins to Rowan's Ravine 
was obtained: 


(1) log ((CV(ogd) + 1)/ PCo)) = 20811 — 02-0241 Dl o,d) 


WHERE V(o,sd) is the number of visiting parties coming from 
an origin, o, to a park,d' P(o) is the population in 
thousands of 0; and Dlo,d) is the distance from o to the 
destination, d,in road milese The constant 1-0 was added to 
visit numbers to avoid the problem created by taking the 
logarithm of zeroe Logarithms used were ‘base 10'. 

Equation i proved a reasonable explanation of the 
variation in the dependent variable. The R? for the 
regression is Oe«77e The regression coefficients are highly 
significant according to the usual F-teste Also, the 
standard error of the regression coefficient of Dlo;yd) is 
only 0.00783. Nevertheless, the explanation of the use of 
the park is not particularly good, as can be seen from Table 
1e Even though the fitting was done using the data shown, 
the estimate of a total of 5,887 visiting parties from all 
origins is nearly 4,000 below the actual total visitor flow. 

In using OLS regression to estimate relationship 
between visits and distance, each observation point was 
treated as if it were as important as any othere In the 
example, the use of OLS regression treats the observation 
for origin unit 4 (which contributed 60 percent of the 
observed total use) as the equal of the observation for 
origin unit 13 (which contributed approximately 0-3 percent 
of the total use) or the same as observation 16 which 
involved no visits and therefore contributed nothing to 
total usee 

While errors in prediction are partly due to omission 
ot causal factors and ‘measurement errors‘ (see Reference 
9), a further and major cause of poor predictions using the 
model is clearly the heteroscedasticity among the 
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TABLE 1 


STATISTICS PERTAINING TO OBSERVED AND ESTIMATED DAY VISITS 
TO ROWAN'S RAVINE PROVINCIAL PARK, 


TN 19 


Obser~- 
vation 
Unit 


OH co) A Ui & &® NH = 


Obser- 
vation 
Unit 


1 
2 
3 
4 
5 
6 
7 
8 
9 
10 
11 
12 
13 
14 
15 
16 
17 


Distance 


to Park 


in Miles 


133 
126 
104 
61 
34 
14 
2i 
67 
110 
107 
109 
40 
139 
84 
154 
117 
i129 


Estimated 


Visits 


weighted 
regression regression regression 


11,870 


Popul—- 
ation 


32,489 
$1,923 
17,813 
132,432 
11,594 
1,632 
3,871 
29829 
36,889 
3,271 
6,181 
4,237 
21,104 
16,284 
117,405 
4,456 
23729 


Observed 


Visits, 


Vehicles 


Per cent error 


SASKATCHEWAN 


Estimated 


Visits 


unweighted 
regreszZfion 


unweighted weighted 


227 
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Observed Visits - Estimated Visits 


* % error = -----—-—----—----------------- - - = * 100 


Estimated Visits 


<>< <9<9 <> 


observations», and this is not properly dealt with when 
estimating the parameterse The preceding statement is made 
without applying a test for heteroscedasticity (see 
References 7, 8) because, in the following, the nature of 
the variance in observations is derivede 


A FIRST STEP TOWARD *PROPER* ESTIMATION: 
DERIVATION OF A COVARIANCE MATRIX 
FOR THE OBSERVED VISITOR FLOWS 


When the parameters of a model are estimated using 
generalized least squares (GLS), it is necessary to know the 
covariance matrix of the observationse Subsequently, a 
covariance matrix for observed visitor flow, Iv, is derived 
which is critical in obtaining the covariance matrix of 
transformed observation;It, that is eventually used in a 
regression analysise To facilitate the discussion involed in 
deriving Iv the following notations and definitions are 
used: 


vlo;dyt;3e@) observed number of vehicles, with parties in 
collectivity So going from origin o to 
destination d on day t;5 


WHERE the collectivity g is a set of parties which tend to 
have similar behaviour in terms of their probability of 
participating in a given package of recreation activities at 
destination don a day t with its given weather, park 
crowding and traffic, conditions, etce 


{v}(osdytye) the predicted value of v( oydytsg)! an 
estimate, 


plo»dstye) the probability that a party in collectivity eg 
would go to d from o on day ty 


V(0,d,;t3g) the random variable that generates the observed 
values vlo;dzstsa)s 


E( V(o7dytye@)) expected value of V(ord,tye) 
VAR( V(o,;t3d,e@)) variance of Vl ord,tyag )eo 
Additionally, when some subscripts are removed from 


v(o,dstye@), the resulting expression implies ;a sum over the 
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given subscript(s)'e The number of parties going from 
origin o to a destination don day t is vlo;d,t) = @E g 
vlo,gdytye)3 the number over all days is v(o,d) = OE t OE g 
vlogdytye@)3 and the total attendance at the park is vw(d) dE 
o OE t OE g vlorsdyt:a@) = DE o vlogd de 

The most usual assumption in regression analysis (the 
OLS or homoscedasticity assumption) is that v is a diagonal 
matrix of the form: 


47 
< 
it 
® 
tr 
LY) 
@ 


WHERE @E2 is the variance that applies to all observationse 

But, now consider that visitor flows from o to d on day 
t for group ge depend on Nlosve)s the number of people in ¢g 
available to participate, and plo,dytse), their probability 
of participatinge The very nature of the definition of 
Vloedytsg) in terms of Nlo,g) and ploydstyg) implies that 
Vlosdetsea) is a binomial random variable with mean and 
variance as follows: 


(2) EC Vl ogdstre)) = NC o, g) pl Ordgtse) 
(3) VAR( vl osdy,tyge)) = NC o,e )ploydstyg )l 1i-~p(o0o,t;2;d)) 


and, if plosgd,ts;ge) is small, Equation 4 holds so _ that 
Equations 5 and 6 followe 


(4) VAR( V(lo;,d,t,e) approx = NC ose )plosdsteg) = E( V(lo;dstsa@)) 
(S) EC Vl o,d,t) approx = @E VARC VC osdstyg)) 
WHERE the summation is over ge 


(6) VARCV(l0o,d,t) approx = OE VAR( VC o;dytsa) ) 
= OE E( Vlosdy,tye)) 
= EC VC o,d,t)) 


One should note that vlo,d,t) can be observed and it is 
an estimate of E(V(lo_,d,t))3; an observation is an estimate of 
its expectea value (if the expected value exists)e So, from 
Equation 5S one can see that it is possible to obtain 
estimates of the variance of V(o0y9det )eo 

Similarly, for total use from an origin to destination, 
one obtains: 


(7) VAR (V€o,a)) approx = OE VAR(C Vl o7,d,t ) 
@E EC Vl 0,d,t)) 
= E(Vlo,d)) 


Ht 


WHERE the Summation is over €e 
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Thus the variances in Vloed,t) and Vlo,d) are 
approximately proportional to their respective expected 
values, meaning that either observations or predicted values 
can be used as estimates of the wariance V(0o0,d) (see 
Equation 4)e The merits of using observations to define 
weights to obtain estimates that can then be employed as 
variance estimates ina second cycle of estimation is not 
_discussede 

Now it will be the exception, rather than the rule, 
that a change in vehicle flow to one site will be correlated 
with change in visitor flows to other sites. This is because 
a trip by a single visiting party of a given type on a given 
day froma given origin to one site or another is not 
expected to influence the decision of other parties in other 
vehiclese Obviousty, parties may make decisions based on 
‘what they think other parties will do, but this is not the 
issuee 

Assuming that the fluctuations in daily flows to one 
‘origin-destination pair for one type of user are not 
correlated with similar flows to another origin-destination 
‘pair, it follows that the estimated covariance matrices of 
wlo,d) can be written as follows (with the matrix for 
vo sd,t) or other observations being written in a similar 
way): 


le O| 
; ® l 
(8) Sv(lo,d) = | EC VC o,d)] 
i . | 
10 e | 


This matrix would be the appropriate covariance matrix 
of observations if non-linear regression were being used 
‘with the vlo,d) as the dependent variable. (See Reference 
Se ) 


OBTAINING MODEL PARAMETERS USING LINEAR REGRESSION 


To regard the estimates of the parameters as efficient, 
certain distributional properties of the error term e(o,d) 
must be assumed. The form of Equation 1 suggests that the 
'f#luctuations, in the random variable V(o,d) define the 
variance of the error term e(o,d)e It is complicated to give 
an exact relationship that shows how the fluctuations in 
‘VCo,d) define fluctuations in elo,d)e However, a Taylor 
series expansion of tog (Vl oygd)t1)/Plo) around E(V( 09d, )) 
results in Equation 1 taking the form shown in Equation 9 
which, after ‘simplification’, results in Equation 10- 
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(9) tog f + (alt ]/d[ v]}(o,d)) DELTA VC o,d) approx = 
a + bD( o;d) + e(o,d) 


WHERE 
F = ln ((CVC0,d)t1)/P(o)) and 


the derivative is evaluated at E(V(o,d)) 


DELTA V(o,;,d) 
G10) ns Lh oe approx = a + bD(o,d)telo,d) 
EC V(lo,d)) + 1 


Because the random fluctuations on the two sides of 
Equation 10 must be approximately equal: 


DELTA Vlo,d) / (CEC V(0o,dad))+1) approx = elo,rd) 


Because the series expansion is about E(V(osd))y by 
definition DELTA Vlo,d) is the fluctuation of V(o,d) around 
its expected value, so its variance is the same as-~ the 
variance of Vlo,d)e What is more, the variance of V(o,d) is 
VAR( V0 ored)) which, as was shown earlier, is approximated by © 
E( V(o0,d))e So, by well known statistical theorems: 


(11) VARCe(o,d)) approx = E(V(0o,d))/CEC VC oyd)) + 1)? 


Since Equation 8 shows that the vlo,d)'s are 
uncorrelatedy an appropriate covariance matrix for GLS 
estimation of the parameters in Equation 1 is one that is 
defined using estimates of the expected values in the 
following: 


{pC 0) O| 
I e | 
(12) @E (T)Coyd) = | - | 
{ e i 
10 D(C.) 


It should be noted that the preceding discussion has 
implied that all use of a park during a given period is 
monitorede Usually, however, the total traffic flow (or the 
components of that flow) to a site from an origin is not 
observed but estimateds The dependent variable in a 
regression thus will likely be a function of vvlo;d )y a 
weighted sum of observationse Yet the fact that in a 
particular regression vv(o,d) is’ the dependent variable 
presents no particular probleme One may simply use the 
variance in the vvlo,d)'s in a GLS regression by entering 
them in place of E(V(o,d)) in Equation 12- The weights 
w(oyd,t) used to multiply the flows v(o,d,t) to get vvlo,7d) 
can be usea to obtain the variance in vv(o0,d)e Using both 


TN 19 CHAPTER VII page 556 


the results presented in Equation 8 and the well known 
statistical theorems that deal with variance, it follows 
that if V is used as a notation to indicate survey 
observations with time in hours, days or some appropriate 
unit and if vv(o,d) = OE w (oydyt) vlosdyt) where the sum is 
over t, then: 


(13) an estimate of variance in vv(o,d) = 
DEw? (ogdyt )vv(o,d,t) summed over all sample times 


Alternatively, if a survey design allowing variances in 
origin-destination visitor flow estimates to be calculated 
was used, the estimates obtained could be employed in 
defining the covariance matrixe 


- AN APPLICATION 


The preceding discussion implies that it is appropriate 
(1) to accept the heteroscedasticity of variances in flows 
when using the ,‘ogarithmic additive’ model, and(2) in 
determining regression coefficients, to give larger weights 
to origins contributing large flows of visitors than to 
those that contribute small visitor flows to the total use 
of an areae 

For reasons cited earlier, the covariance matrix of the 
observations used in making GLS estimates of the parameters 
of Equation 1 is essentially given in Equation 12. When the 
parameters in Equation 14 were estimated using GLS for the 
same set of data as used to derive Equation 1, the following 
equation was obtained: 


C14) log (Vl osd )+1/P(0)) = 3613701 — 0.260 DM 07a) 


The predicted numbers of visits from each origin to 
Rowan's Ravine and the percent of error between predictions 
and observations are presented in Table 1 as they were for 
the OLS regressione The R* attained was 0687, which was up 
from Oce77e As in the OLS regression, the regression 
coefficients are highly significante And, in this case, the 
standard error of the regression coefficient of D(o,d) was 
0.00246. Comparing Equations 15 and 1, one sees’ that the 
coefficient of Dlo»d) is relatively constant (at 0.02413 for 
CLS and 0.00246 for GLS)e These coefficients also have small 
standard errors (0.200340 and 0.00246) in comparison to their 
actual valuese 

An obvious difference between the results of the two 
regressions is seen in Table le The residuals obtained using 
GLS regression range from 1 to 1,595, as compared to the 
residuals of the unweighted regression analysis which range 
from 1 to 2,978 But, because of the bias involwed when 
antilogarithms of predicted values are taken to obtain 
estimates of the individual flows, it is not clear to what 
extent the larger residuals for the OLS regression are due 
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to model specification error, to measurement error and to 
tpnure" Logarithmic transformation biase 

One should recognize that GLS regression analysis 
resulted in an increase of percent error for some flows, 
such as those from observation units 5,5 7y and 124 However, 
all of these flows are small compared to the flow from 
observation unit 4 to Rowan's Ravinee As one can see from 
Table 1, on the average the percent error in the individual 
flows was greatly reduced by using weighted (GLS) 
regressione Had a weighted average been used to compute 
average error, GLS results would have appeared even betters 
Observation 4, which contributed about sixty percent of the 
total visitor flow, had its error reduced from 103 percent 
to two percente Regardless, whether a percent RNS error 
measure or variances of parameters is considered, the GLS 
model is superior to the OLS modele (See Table 2. ) 


AN *ABSOLUTE* MEASURE OF MODEL APPROPRIATENESS 


From a critical perspective, the model described in 
this paper thas been assumed to be structurally sounde The 
wary reader may be disturbed by this assumptione Actually, 
the theoretical error distributions developed can be used to 
see if the observed residuals are distributed as they shouid 
be if the model is structurally appropriate for the datae 
The results already presented suggest that one consider: 


(15) xX*(M-—N) approx = (residual )*/E( V(o,;d) ) 
overall o,d flows 
WHERE M number of flows observed 


N number of parameters estimated, and, for evaluating 
individual flows: 


(16) xX2(1) approx = (residual )2/E(V(o,d)) 


The rationale for Equations 15 and 16 is that for the 
cistribution being considered, a residual squared divided by 
its variance is approximately the square of a normal zero- 
one variablee This is by definition a chi-square with one 
degree of freedome It is recognized that the residuals are 
not orthogonal to each other since degrees of freedom are 
Lost when parameters are estimatede So; in Equation 15, 
degrees of freedom M-N are suggested with N being the number 
of regression parameterSse The authors believe that using 
observations as GLS weights does not result in the loss of 
further degrees of freedome 

Using Equation 16, it is possible to see that, over 
all, there are structural problems with the modele The Large 
Xx2 values in Table 3 actually make it clear that the model 
does not do as well as it should in explaining the observed 
flowse The X2 M-N having a highly sign value, is “absolute" 
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TABLE 2 


PERCENT ROOT MEAN SQUARE (RMS) ERROR 
AND RELATED STATISTICS OF OLS AND GLS ESTIMATES 


Improvement 
Error Factor 
Measure OLS GLS GLS/OLS 
%® RMSE Error* 200.00 95-91 0.48 
SeDe of bD*¥* 02-00783 0.00567 0072 
Average - ; - 0-60 


* % RMS Error - (1/17 OE (% error of observation i )2)1/2 
WHERE percent error of observation i = 


observed visits - estimated visits 


ott Ee GES ae ED ae) one GED Ee eS eee ES a aoe Se AS ee Ge eT ee ees ee Ga em aoe cay aoe aoe a a ee ame om 


estimated visits of observation 


** It is recognized that estimates of the Standard Division 
in the regression coefficient, SeDe of b, is biased when 
OLS is used with heteroscedastic datae One can of course 
use the data provided to calculate an unbiased OLS 
estimate of the SeDe of b but it is not relevant to the 
problem under consideratione This is because the concern 
is with comparing a procedure accepting the OLS’ model 
with a GLS resulte 


OS OLOLO LO, 


proof that the structure of the model used is not totally 
adequate to explain the observed flows so its value is an 
N“absolute" criterion for the structural adequacy of a modele 
Obviously, several models could be accepted as structurally 
adequate and, if this is the case, then it is possible that 
the methodology of Smith (see Reference 12) should be 
employed to select one model as the beste 

As well as recognizing that an overall structural 
adequacy test can be made, it should be noted that the very 
large x21 values associated with observation units 5 and 7 
certainly reflect problems with the model used because those 
abserved for the origin-destination flows have essentially 
zero probability of occuringe However, careful examination 
of how origin areas 5 andZ7 were defined and how the 
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distances from these areas to Rowan's Ravine were measured | 
suggests that the poor agreement between predictions and — 
observations is a result of Dlorsd) being given a value that 
is smaller than the value it should havee Similar | 
considerations allow one to understand other significant 
residualse A table such as Table 3 has good information fron 
which to improve a model and such a table should be computed 
as a routine part of analysis if a model is not structurally | 
adequatee | 


CONCLUSION 


This paper has dealt with obtaining efficient estimates 
ef the parameters in an equation that defines a relationship | 
between visitor flows and other variablese The rather 
obvious conctusion that has been reached is that, in 
regression analysis of flows using an equation such as 
Equation 1, greater weights should be given to the more 
accurate observations of visitor flows so that efficient. 
flow and parameter estimates can be obtainede 


A practical consequence of having more efficient | 
estimates is that research costs can be reduced or planning 
accuracy improved without increasing existing data | 


collection costSe The fact that, on the average, accuracy | 
improvement was sixty percent (see Table 2) means that to | 
achieve the GLS tevel of accuracy using OLS, about three 
times as much data, (1/00e60)2 =! 3, would be requirede Since | 
using GLS regression costs no more (or little more) than | 
using OLSe 

Finally, regarding point 3, it is admitted that in. 
deriving the covariance matrices for v(o,d) and vloydyt),y a 


mumber of assumptions were important in reaching the | 
expressions derivede The validity of these assumptions about) 
the behaviour of recreators must be checkede The specific. 


concern must be whether probabilities that are assumed to be | 
Small are smalle However, one should not make too much of | 
thise As noted for point 2, small or even moderate errors | 
(320 or 40 percent) in the variance elements of covariance | 


matrix —- errors due to poor approximations —- have less | 
etfect on the parameter values estimated using the | 
covariance matrix than one might expecte 

In conclusion, an example helps iltlustrate the | 
importance of having both efficient estimates and an_ 


absolute measure of a model,’ structural adequacye The work | 
of Van Doren and Ellis cited early in this paper compares | 
the "goodness'* of a gravity model and a systems model to 
explain trip distribution in Michigane They show that a 
systems model is 30 to 40 percent *more accurate’ than a 
gravity modele But Van Doren used OLS in estimating the 
parameters of his gravity modele If Van Doren has used GLS 
he would have achieved about 60 percent improvement in 
accuracy and it would have been concluded that both models 
were equally good or that the gravity model was slightly 
bettere 
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TABLE 3 


COMPARISON OF RESIDUALS AND OBSERVED VALUES 
Pred—- Residual 

Origin icted (observed- Residual 
area Flow predicted) Squared 

n | Je) 14 196 

3 66 —3 9 

4 §,731 137 18,769 

5 22315 —-1,595 2,544,025 

6 1,001 969 938,961 

8 86 —-50 2,500 

9 98 : 1 1 

11 16 2 4 

12 601 -187 34,969 

13 10 17 289 

14 190 -127 16,129 

15 38 88 71_5 744 

Est Observ Approx Probability 
Origin Use KKK x2 Of the 
area Ei E2 X2 value 
* 

1 38 266 239 x 

| 1.213 1-98 e006 x 

4 98.81 173.50 014 * 

5 39-91 70-08 49-11 % 

6 17-43 30-60 41.50 x KK 
7 27282 48-85 42.30 AK 

8 1-48 2e59 1-30 x 

9 1.68 2095 0004 * 
11 e27 e47 e011 * 
12 10-36 18.19 2-60 x 
13 e177 29 1-32 ss 
14 3e27 5214 3-80 * 
15 265 1.14 9-18 * 


% This value of X12 has a 
shown: 


KK 


P(X12 J 3-84) = 


PCS22)F 


as shown: 
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P(X12 J 


e 0006 


This value has 


9) = 


20 
3) = 


5 
e 02 


P( X12 F .0039) 
P( X12 F -00016) = 


a low probability of 


e 0027 
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high probability of occuring as 


= e05 
201 


occuring by chance 


P( X12 F 10) = 


e©00157 
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*x*k*k E1 and E2 values are estimates of the observed weekday 
and weekend use respectivelys These figures were used in 
calculating chi squared values because total use was 
estimated by: 


Use = (18) (number of weekday questionnaires returned ) a (27) 
— (number of weekend questionnaires returned ) 


So since it was known that for all thandbacks: E1L/E2 = 
168/295 and the equation: predicted use = 18E1 + 27E2 were 
used to derive estimates of El and E2 for each observed 
flowe Then the variance in the use was calculated as 
described in the paper: Var(use) = (18)2E1 + (27)2E1 Thus Xi 
= (residual squared/variance (use) )e 


OT OSOLOLO, 
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It should also be noted that if a significant X* value 
of ‘absolute’ fit were found for one or both Michigan 
models, one would be forced to face the fact that neither 
Van Doren or Ellis had done that well in explaining 
behavioure Certainly there is need to be more concerned with 
adequacy of model structure before comparing R2's 
corrrelated measures to show that a model is bettere 


IN 19 CHAPTER VII page 563 


CORD STUDY TECHNICAL NOTE 35 


STATISTICAL CONSIDERATIONS 
USING GRAVITY TYPE MODELS 
TO EXPLAIN VISITOR FLOWS 


MeFe Goodchi ld 


ABSTRACT 


This paper reports on an inquiry into the problems of 
fitting ageregate spatial interaction models to empirical 
datae The concern is with flows of visitors to recreation 
sites from a variety of origins, and with the class of 
models for those flows, normally referred to as gravity 
modelse The paper reviews the conventional approach to 
spatial interaction analysis, using standard measures of 
Successe Then four basic problems are discussed in the 
context of a small data sete These relate to: non-linearity 
of models; integral walues ot visitor flows; volume of flow 
observed and goodness of fit; and the weak theoretical basis 
for the mocels usede Finally, the paper examines other 
problems which arise in more complex situations, and makes 
general recommendationSe 

The early sections on problems are statistical and will 
be of more interest to technical readers; Later sections are 
more general, and the statistical results are restated non- 
technically in the conclusionse 


INTRODUCTION 


This paper reports on an inquiry into the problems of 
titting agererate spatial interaction models to empirical 
datae More specifically, it is concerned with flows of 
visitors to recreation sites from a variety of origins, and 
with the class of models for those flows, normally referred 
toe as eravity models from a rather tenuous analogy to the 
Newtonian inverse square law of gravitational attractione 

The most general form of the gravity model is as 
tollows: 


Idi,gj) = £01 )0FC1i)If°0290ACj ) 03 )C DCI, y)) 
WHERE ICisj) is the flow from origin i to destination jy 


P(i) is a measure of the potential supply of visitors 
from origin iy 


AC j) is a measure of the attractiveness ot place jy; 
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D(C iyj) is a measure of the trip from i to jy and 


£(C1), £02), and £(3) are functions calibrated for a given 
activity and set of origins and destinationse 


Since the early sections of the paper consider’ the 
problems of fitting models to a Single destination, £2 can 
be ignorede Assume that P(i) is the population of the ith 
origin, that fl is linear and that £3 takes one of the 
following forms: exp(bD(i,j)) or DC is,j)**be Then there are 
two possible expressions of the general form presented 
above: 


(1) ICisJj) P(i) exp (a + bD(i,Jj)) 


(2) ICisj) = PCi) a Dl i,j )**d 


While Equation 2 is the historic gravity model, recent 
studies have shown increasing interest in Equation 1, both 
on empirical and apriori theoretical groundse ( See 
References 11 and 63.) Furthermore, the algebraic difference 
gives rise to rather different methodological problems in 
each casee 

There are several such problemse First, the non-linear form 
of both models means that if standard linear regression 
techniques are to be used in calibration, there must be a 
transformation of the variables, so that the results of the 
analysis appear in units which are often misleading, and 
which make comparison with the original data difficult. 
Second, visitor flows are composed of integral numbers of 
peoplee If flows are sampled over some limited time period 
and used as estimates of long-term interaction, then the 
sampling process will be quite different to that normally 
assumed in regression analysise Third, the success of the 
analysis, or the degree to which the model fits the data, 
will depend on the Length of the sampled period, so that the 
€reater the total visitor flow sampled, the better the fite 
Finally, gravity models have a rather weak theoretical 
basise Such explanations as do exist (see Keferences 41, 51, 
63) contain strong assumptions which are easily broken in 
the real world, so that the model may suffer from structural 
inadequacies in specific situationse 

The early sections on basic problems are statistical and 
will be ot more interest to technical readers; later 
sections are more general, and the statistical results are 
restated non-technically in the conclusionse 


THE CONVENTIONAL APPROACH 


The data set used for illustration and simulation in 
this study is the one discussed by Beaman, Knetsch and 
Cheung (see TN 19). It gives the visitor flows to Rowan's 
Pavine Provincial Park in Saskatchewan, from seventeen 
origin areas (see TN 1)e In the sampled period, the year 


q) 
Gil 
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TABLE 1 


ANALYSIS OF EQUATION 1 


a b Re PEX** * RMS**** 
OLS —e 488 —.0547 2/55 138-4 217-3 
( e685 )* 
NLLS —e 812 ~e0381 ent OD 7520 108-7 
WNLLS —e 365 —20585 e296 191.8 3IT7e7 
(10461 )** 
MIN RMS e368 —~.e0495 2486 6728 719 
* R2 given first for log I(isj)/P(i): in parentheses for 
ICieJj) 
3K 3K Figure in parentheses computed using weighted 
observationse 


xk Mean absolute % error in flowe 
xk Root mean square error in flowe 


(OLS=Ordinary Least Squares; NLLS=Non—-Linear Least Squares; 
WNLLS=Weighted Non-Linear Least Squares; MIN PE=Minimum Mean 
Absolute Percent Error; MIN RMS=Minimum Root Mean Square 
Errore ) 


OTOL OLOLO, 


19602, there were 9,828 visitor vehicles, including 5,862 
from the observation unit containing the city of Regina, and 
none from four of the arease Following the earlier work, 
each observed flow has been arbitrarily increased by 1 when 
calibrating logarithmic models to avoid having to take the 
logarithm otf zeroe 

The conventional approach to calibration is to 
transform the equations to linearity by taking Logarithms. 
Eguation 1 can be transformed as follows: 


In (ICigg)/PCi)) = a + DDCisj) 

and Equation 2 thus: 

In. CiGisddA/PCiD de selina eet belnyPGis J? 

so that both equations are of the form y = c + dx, and both 


involve two fitted constantSe In the first, the negative 


TN 35 CHAPTER VII page 266 


TABLE 2 


ANALYSIS OF EQUATION 2 


a b R2 PE RMS 
OLS 10965 -3.40 e767 111-67 141.25 
( e344 )* 
NLLS 163 -2-01 e903 10-8 1529 
WNLLS 8770 —-3-«50 —e023 23663 308-6 
(9320 )** 

MIN PE 
30000 —-3e42 -2020 46-7 59e4 

MIN RMS 


* See footnotes to TABLE le 


OLOLOLOLO, 


exponential case, Un (I(Ci,yj)/P(i)) is regressed against 
D(isj) to estimate values for a and by while in the second 
ln CICi,j)/P(i)) is regressed against lin D(i,j) to estimate 
ln a and be 

The results of these OLS, or Ordinary Least Squares 
regressions for the Saskatchewan data can be found in 
Figures 1 and 2e The R* values, describing the degree to 
which In (I(Ciygj)/P(i)) can be predicted from D(i,j) and In 
Dl isyj)e were e755 and e767 respectively, and the respective 
equation coefficients a and b can be found in Tables 1 and 
2e Statistically, both analyses were highly significant and 
led to a confident rejection (at the 992.9% level) of a null 
hypothesis that the same R® values could have been obtained 
from a correlation of independent variablese 


THE TRANSFORMATION PROBLEM 


The Largest residual observed in the negative 
exponential regression (Equation 1) was -3.477; in the power 
law regression (Equation 2) -3e6706 In terms of the 


ordinate, ln I(i,j)/P(i),y which ranged from -12 to 0, these 
residuals were not large and were visually acceptable (see 
Figures 1 and 2)e But in terms of I(i,j)/P( i), in which the 
range is from e00001 to 1, the residuals represented an 
overprediction by a factor of roughly 40. Specifically, the 
observed flow I( i,j) for the point with the Largest residual 
was 1L3 the first OLS model predicted a flow of 32e3, the 
second 392e3~5 

Residuals were then recalculated by taking the 
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difference between the observed flow, and the flow predicted | 
values of In I(i,j)/P(i)e Unlike the earlier values, these | 
residuals will not sum to zero for either modele Equivalents | 
to the earlier R* were calculated from the ratio of the sum | 
of squared residuals to the visitor flow sum of squares for 
each model, to give e685 for the negative exponential and | 
0344 for the power law. Clearly these models are much less > 
successful if measured in terms of their ability to predict | 
I(iygj) rather than tn (1CigJj)/PCAI) do 

Computing the difference between observed and predicted | 
flows inevitably gives most weight to origins which | 
contribute a high flow, and wery Little to smaller, more _ 
distant originse Yet the planner who is interested in the | 
proportionate difference between the observed and the | 
predicted would give as much weight to a 10 percent error in. 
a flow of 50, as to the same percentage error ina flow of 
50006 Various authors have suggested that a percentage error | 
in observed tlows is a more useful measurement of residual 
variation in the recreation context (see References 16, i7) 
and have computed the roct mean square percentage error | 
(RMS) as a substitute for the conventional R# (see TN 19). 
The respective values of RMS error for the two models were > 
217e¢3 and 141.65, and the means of the absolute percentage | 
error (PE) 138e4 and 111¢7.e Despite the high R® values, the 
predicted power of the models is rather weak when expressed > 
in these termsSe 

A more direct approach to the fitting of the general 
model would be to avoid the logarithmic transformations 
altogether, and fit Equation i and 2 by a teast squares 
procedure, directly minizing the sum of squared differences 
between observed and predicted flowse This was done using @ 
combination of Gaussian and Steepest Descent methods to 
minimise the objective and the relevant statistics are shown 
in Tables 1 and 2 for comparison with OLSe This procedure 
will be reterred to as NLLS, for Non-Linear Least Squarese 
Cesario (Reference 11) thas discussed a similar approache 
(The widely available TSP (Time Series Processor) package 
was used for Least squares procedures, running in a CbC 
Cyber 73 under Scope 3e4e A Listing of the TSP source 
program is available from the authore ) 

There is a marked reduction shown in both RMS and mean 
absolute error (PE) and although the R*® values should not be 
compared to those derived from the OLS models, they can 
reasonably be compared to the recomputed values based on 
flows which are shown in parenthesese It is clear that the 
use of Logarithmic transformations can seriously reduce the 
validity of gravity-type models, when validity is defined by 
RMS or PE error measurese 

A Ge le BS is argued that the RMS or PE measures are the 
most effective in the planning context, then ultimately the 
most satisfactory way of fitting or calibrating a flow model 
must be by direct minimisation of these criteria, rather 
than the conventional sum of squared residualse In fact for 
this particular data set the degree of improvement is not 
greate The minimum mean absolute percentage error resulted 
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when predictions were made using the equations: 


IC i,j) P(i ) exp (-0-380 - 0.0449 D(C i,sJj)) 


I(isyJj) PC i) e30000 Dl(i,j)(-3-42) 

and while the second equation also minimised the RMS error 
for the power law, the negative exponential was calibrated 
with a= 0ce368 and b = -0.0495 on this criterione The 
accompanying statistics are shown in Tables 1 and 2 as the 
MIN PE and MIN RMS procedurese (The approach used an 
interactive procedure in which the user can watch the 
performance of a general objective function as each constant 
is incremented over ranges prescribed by the usere The 
routine was written by the author for the PDP10. Typically 
15 minutes of terminal session are required to examine one 
objectivee ) 

For comparison, the above equations are plotted in 
Figures 1 and 2 with the results of the earlier procedures. 
They do not have the same visual impact as the OLS 
equations, which pass neatly through the points, since their 
objective functions and residuals are based on I(i,j) and 
thus only indirectly related to the ordinate log 
C(ICi,gj)/PC i) de 


THE HETEROSCEDASTICITY PROBLEM 


In using the linear regression model, one makes the 
assumption that residuals are independently distributed with 
a constant variancee In terms of the interaction model 
transformed by taking the Logarithm of I(i,j)/P( i), each 
observation is assumed to be subject to the same residual 
error distribution, or in terms of I(i,j)/P(i) -—- to the same 
proportionate errore If tn (1(i,j)/P(i)) is assumed to have 
normally distributed errors, so that errors in I(i,j)/P(i) 
will be lognormal, or normal when expressed as proportions 
one has a model for usual Linear regressione 

In reality, this model is quite inappropriate to the 
expected distribution of I(i,sj)e If the observed visitor 
flow is the result of a very large number of samplings of 
the origin population under avery small probability, then 
I( i,j) can be expected to follow a Poisson distribution (see 
TN 19 )e In other words, its distribution is discrete and 
non-normal, with a variance which is equal to its expected 
value, and thus varies from observation to observatione The 
variance in Un (ICi,j)/P(i)d) is compounded by the 
distribution of P( i), and clearly violates the 
homoscedasticity assumption of OLSe 

Beaman et ale (see TN 19) have calculated the variance 
in In C(ICi,gj)/PCi)) as approximately proportional to 
L/I(Ciszj) by expanding the logarithm function in a Taylor 
seriese (The error variance of lUn (I(Ci,j)/P(i)) clearly 
depends on the value of P(li)e Beaman et ale made _ the 
implicit assumption that P(i) is constant in their Equation 
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Se) Thus the high-flow observations are subject to a much 
smaller error variance and merit much greater weight in the 
regressione Beaman et ale used a generalised teast squares 
procedure (GLS) weighted by the inverse of the variance 
estimates to fit the logarithmic transformation of Equation 
le 

A similar approach can be taken in the direct, non- 
Linear regressione If we assume a Poisson distribution, the 
error variance of I(i;j) in Equations 1 and 2 can be taken 
as ICi,zj) itselfe Then the appropriate weighted least 
squares criteria are: 


Minimize @E (1(i,j)—-P(i) exp (atbD(i,j)))2/1isJj)) 
and 
Minimize dE (1(i,j)—-P(i) aDli,gj)**b)2/1( i, J) 


These criteria give the greatest weight to the smaller flow 
values, since in terms of I(i»j) these are the most 
reliable, whereas for the logarithmic regressions it was the 
high flow values that had the least error variance and the 
greatest weighte The criteria are quite similar to those for 
the direct minimisation of RMS error: 


Minimize (1i/n OE (I(i,j)-P( i) exp (atbD( i,j ))* )/ 
(P(i) exp (atbD(i,j))2 ) )**( 1/2) 


and 
Minimize (1/n OE (ICi,gj)-P(C ida Dl i,j )**b)2/ 
(PC i) aDli, j)**b)2 )**1/2 


WHERE n is the number of points 


except that the denominator is formed from the predicted 
rather than the observed flowe 

Again, the results are shown in Tables 1 and2 and 
Figures 1 and 2 for comparison with the earlier procedurese« 
Error measures were computed without observation weightse 

In both cases there is a considerable deterioration in 
the degree of fit reflected in the error measureSe The 
corresponding weighted RMS figures are 104-1 and 93.20 
respectively, and give a fairer representation of the degree 
of fite But as would be expected, in neither case do the 
figures approach those for a direct minimisation of RMS 
errore 

It is intuitively reasonable to weight observations in 
inverse proportion to their error variancee Since the error 
variances are Known from theoretical arguments, it is 
possible to make corrections for theteroscedasticity by 
weighting observationse But in reality the observed error 
variance is composed both of a Poisson-distributed part and 
a component due to structural inadequacies in the model, 
with an unknown distributione Further, the data also 
violates several other assumptionse Errors are not normally 
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distributed: when expressed as small visitor flows they are 
Poisson, with a highly skewed discrete distribution, and 
when expressed as log I(i,j)/P(i) they are compounded by the 
distribution of P(i)e Further, because the values of the 
variables themselves are not sampled from a bivariate normal 
distribution in any of the models, questions of statistical 
inference and parameter estimation are discussed in the next 
sections through the results of Monte Carlo simulationse 


SIMULATION TECHNIQUES 


The models under examination involve three "sampled" 
variables, IC@isgj)e P(Cid and Dlis,j)e But of these we may 
assume that Dli,yj) and P(i) are known to considerable 
accuracy, and that all error is concentrated in I(i,yj)e Thus 
error will be of two types: a statistical component due to 
the use of a flow value sampled over a Limited time period 
as an estimate of a long-term average flow; and a systematic 
component due to the structural insufficiency of the model 
in explaining visitor flowse 

In the next two sections, the results of simulations of 
the statistical error component in the Saskatchewan data set 
are discussed with two objectivese First, it is possible to 
simulate the sampling distributions of the important 
regression parameters and so gain a more quantitative notion 
of the relative importance of the statistical and structural 
componentse Second, simulation can establish the importance 
of sample size in parameter estimatione Because of the 
complexity of the error distribution, the results will be 
specific to the data set and only qualitative generalities 
can be expectede 

The distribution of I(i,j) is assumed to be Poisson for 
a given observatione I(iszgj) is an unbiased, maximum 
likelihood estimate of the Poisson density parameter m such 
thats: 


P(r) = ml r] exp (-m)/r! 


WHERE P(r) is the probability that a Poisson process of 
density m will generate exactly r events in a triale 


To simulate the effect of statistical error, the observed 
visitor flows were used as estimates of m,; and for each 
simulation run, each flow was independently replaced by a 
random Poisson deviate of that densitye Thus the range of 
parameters found for models fitted to this simulated data 
will represent the range of uncertainty attributable to 
statistical errore 

Two methods were used to generate the deviates, an 
exact method for m less than 30, and an approximation above 
that valuee In the exact method, a single random number was 
generated in the interval O- 100¢e¢ Poisson probabilities 
were calculated from r = O until the cumulative’ total 
reached the random number, treated as a percentage, and the 
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corresponding value of r became the Poisson deviates 

The approximate method used two standard statistical 
relationships. First, the probability that a Poisson process 
of density m will generate a number less than r is identical 
to the probability that a X? process with 2r degrees of 
freedom wilt generate a number greater than 2m (see 
Reference 34)e Second, the X* distribution is approximated 


by a unit Normal distribution for large numbers of degrees — 


of freedom: 


NC 0,1) = (2X2 )**1/2 -— (2V-1 )¥*1/2 


Normal deviates were generated by the method described j 


by Box and Muller (Reference 5), in which two independent 
random numbers, uniformly distributed in the interval 0-1 


(Rl and R2), are combined to give two independent random 


numbers with an approximately unit normal distribution (Ni 
and N2)>; ; 


N1 = (-2 Un R(1i))**1/2 sin (2 PI R(2)) 
N2 = (-2 ln R(2))**1/2 cos (2 PI R(1)) 
Then the normal deviates were converted to a Poisson 
distribution via Xe 
DISTRIBUTION PROPERTIES 


Two types of simulations were made, one using real 
visitor flows as the basis of simulated flows, and the other 


using predicted flows from the OLS negative exponential | 


model as the basise The results of the first series of) 


experiments are shown in Table Je Twenty-four runs were | 
made 9 and analysed using the three models based on the > 
negative exponential; the Ordinary Least Squares procedure > 


using In (1(i»,j)/P(i)) as the dependent variable (OLS); the 
direct least squares calibration of Equation 1 (NLLS), and 
the latter using weights estimated from the predicted error 
variances (WNLLS )e 


The results show that for this data set, statistical. 
error is very much less important than structural errore For 


the OLS case, the standard error of the _ coefficient in the > 
original calibration of the model was estimated to be 
~OO8036« Yet successive simulations of statistical errors 


produced a range of _ coefficients with a measured standard 
deviation of only -0005- Similarly, the variation in R? as a 
result of simulation suggests that the major source of 
unexplained variance is structurale Again,y the regression 
standard error in the _ coefficient is e808, compared to a 
standard error in simulation runs of e046. 

Results are very similar for the non-linear regressione 
The skewness in the RMS and PE distributions is largely 
removed, and the consequent bias in single estimates is 
reducede Againgy the estimates of a and R@ are remarkably 
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TABLE 3 


FIRST SIMULATION EXPERIMENTS 


a b re PE 
OLS 
Actual —e488 ~—e 0547 e755 138.4 
Simulation —e489 —e0547 0754 140.2 
Std Error e046 e 0005 e010 503 
NLLS 
Actual —-e812 —e0381 e905 7520 
Simulation —e814 —e 0382 e902 71501 
Std Error 2047 ©0011 e010 2e2 
WNLLS ; 
Actual —e365 -~e0585 e296 191-8 
Simulation -—«378 — —e0585 2289 196.4 
Std Error e054 e 0005 e024 152 

TABLE 4 

SECOND SIMULATION EXPERIMENTS 

a b R@ PE 
OLS 
Actual —e488 —e0547 1-000 0-0 
Simulation —-e 528 —.0546 e984 2026 
Std Error e089 ©0019 e010 5e2 
NLLS 
Actual —e488 —~e 0547 1-00 02-0 
Simulation —e536 —e 0538 e999 20-0 
Std Error e049 ~0010 e001 5-5 
WNLLS 
Actual —e488 —e 0547 1-000 0.0 
Simulation —e505 --0548 e984 21.1 
Std Error e072 ©0015 e015 Te3 

OLOTOLOLS: 


precise, although there is a slight increase in the standard 
offset by a 


error for the b coefficientse This is somewhat 


proportionate reduction in the standard error of 
results revert 
pattern shown by CLS, giving slightly better estimates of b 


other handy, the weighted regression 
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and biased estimates of RMSe 

The second set of simulations show the amount of error 
introduced into a perfectly predictable set of data by. 
adding Poisson-distributed errors to visitor flows (Table | 
4)e They show that the statistical error component has) 
relatively Little effect on error statistics by itself, so) 
that the wast majority of the error observed in the originai | 
data must by structurale | 

PE and RMS errors are remarkably independent of the | 
method of calibration when structural error is absent, while | 
R2 is more sensitive to the statistical error component in| 
OLS and weighted non-linear (WNLLS) regressionss Standard | 
errors of the coefficients tend to be rather higher than | 
they were in the first set of simulations with real data, | 
except in the case of NLLSe 

The above simulations, particularly the first set, | 
provide a rapid method of evaluating the relative importance | 
of statistical and structural error in the calibration of a. 
spatial interaction model. Because of the dependence of the | 
result on the specific configuration of the data set, and 
the immense difficulties of any analytic approach, it is 
suggested that such Poisson Simulations be incorporated into 
any calibration of this type of model, particularly in cases 
where samples are small, and statistical error 
correspondingly highe 


AGGREGATION EFFECTS 


Sample size will affect the calibration of a spatial 
interaction model in various waySe The statistical error 
component has known properties, if we can continue to assunie 
that its distribution is Poisson, so that the error variance 
in observed flows will increase in direct proportion to the 
length of time samplede The standard deviation will he 
proportional to the square root of sample size, so that it 
will decrease in proportion to the observed flow as sample 
size increaseSe 

While statistical error may be well understood, there 
is no Similar basis for assumptions about structural errors 
In one particular case, structural error might appear as a 
normally distributed residual, as a result of other; 
external effects on visitor flowse In another casey, 
structural error might be the results of a misspecification 
of the functional form of the model, perhaps in the effect 
of distancee It would seem reasonable on dimensional grounds 
to assume that the standard deviation of the structural 
error term will be in constant ratio to the observed flow, 
and thus rise in direct proportion to sample sizee Thus 


TOL TY SUP CAs TT Sery Crees sr er! 2 FEC2Z PCA; ; } 


WHERE {I}( i,j) is the expected flow in 
the sampled interval; 
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C(1) I( i,j )**1/2 is the statistical 
error component; 


C(2) I(isgj) is the structural componete 


STRUCTURAL ERROR 


The gravity model is a purely inductive device, open to 
charges of curve-fitting which can only be countered by 
relating the excellent degrees of fit invariably obtainede 
The first part of this paper has concentrated on two 
problems encountered in calibrating the model: variations 
in performance between different procedures; and effective 
measurement of the goodness of fite But while it is possible 
to evaluate the relative importance of statistical and 
structural error, and to make allowances for the latter, it 
is clear that the existence of the structural component 
represents an important deficiency in the modele 

Any approach to the reduction of structural error must 
begin at the theoretical level with a discussion of the 
basis for the model's excellent fit to realitye The 
literature offers two major classes of explanation, first 
through hypotheses made at the aggregate level, and second 
through hypotheses about the behaviour of individuals in the 
systeme 


AGGREGATE THEORIES 


Wilson (see, for example, Reference 63) has shown that 
under certain assumptions, trips made in a completely random 
way between origins and destinations will aggregate so that 
the total interaction between origin i and destination j is 
given by: 


(3) I€ i,j) = ko kd OCi) D(C Jj) exp (-be (isj)) 
WHERE O(i) is the total flow from i, 

D(j) is the total flow to Jj» 

c(i,yj) is the cost of one trip, and 

ko, kd and b are constantS, 
which is algebraically similar to Equation le 

The method can be called an entropy maximising strategy 
because there is a useful analogy to Boltzmann's statistical 
thermodynamicse Suppose the I(i,j) are known so that a 
table can be constructed to show the flows between each 
origin and each destinatione It is possible to calculate the 


number of ways in which specific individuals can be assigned 
to specific flows, given the table of totals along each 
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pathe For example, if all individuals are in origin 1 and 
travel to destination 1, then only one arrangement of 
individuals is possible, while if trips occur between 
multiple origins and destinations there are many ways in 
which the individuals can be rearranged without changing the 
totalse 

Not all patterns of flows are possiblee The total flow 
from origin i must equal 0(i1), and arrivals at j must equal 
D( j)e Wilson introduces a further constraint that the total 
cost of all trips made in the system be Ce 

In a purely random world, in which no behavioural rules 
exist other than the constraints, it is reasonable to assume 
that all possible arrangements of trips are equally Likelye 
This means that it is possible to calculate the probability 
of each pattern of flows from the numbers of alternative 
arrangements of individuals associated with each pattern, 
and to find the most Likely patterne The result is Equation 
3e Provided the number of individuals in the system is 
large, this most Likely pattern proves to be so much more 
likely than any other that it is possible to predict with 
some confidence that it will be the one found in realitye 

There are three major ways in which structural 
inadequacies can arise in Wilson's analysis, and he 
reflected in structural errors in the modele First, the 
model invokes what is sometimes known as the Principte of 
Insufficient Reason (see Reference 31) and assumes that ali 
arrangements of individuals are equally lLikelyo This wilt 
be untrue if, for example, certain kinds of individuals show 
a preference for certain types of trips, or if a4 set of 
individuals is associated with a particular origin and may 
only make trips from that origine Second, the constraint 
that the total cost be C may not reflect behavioural 
realitye It is more likely that total cost fluctuates as the 
sum of a number of fluctuating individual expenditurese 
Third, behavioural traits may take the form of further 
constraints which Limit the set of possible arrangements, 
and may lead to different conclusionse 


BEHAVIOURAL THEORIES 


In principle, a theoretical base could be established 
for the gravity model if it could be shown that a large 
number of individual actions, each made according to some 
general rule, would combine to produce aggregate flows which 
were in agreement with the modele Ewing (Reference 18) 
discusses some of the relevant problemSe In essence, in 
order to ayvgregate behaviour in which distance appears as at 
least a partial criterion; one must assume some kind of 
geometrical arrangement of origins and destinations, and so 
the aggregate flows will be unique to the geometrye it is 
possible for behaviour according to a general principle to 
combine with a specific geometry to produce a specific 
aggregate pattern, and it is conceivable that a specific 
behavioural principle might combine with a specific geometry 
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to produce a general aggregate pattern, fitting the gravity 


modele But there is a logical impossibility in conceiving of 
a general behavioural rule which would give a general 
aggregate pattern independent of the specific geometry ~- and 
there is no general principle of human geometrye 

Niedercorn and Bechdoldt's analysis (Reference 43) has 


received a great deal of attentione They consider a single 


individual in origin i, and the set of trips that he or she 
makes through time to various destinations je Each trip will 


have a certain utility to the individual, which is assumed 
to depend on the number of purposes which are satisfied by 
the trip, which in turn is assumed to depend on some 
attribute of the destination, depending on the contexte Each 
trip has an associated cost, and the individual is assumed 
to be constrained by a travel budgete- Under these 
conditions, the pattern of trips that maximises’ the total 
utility to the individual is shown to be of the form of 
Equation 2, under specific assumptions about the utility 
functione 

When aggregated, Niedercorn and Bechdoldt's analysis is 
capable of predicting the relative magnitudes of flows from 
a single origine In a system of many origins and 
destinations, it predicts that the flows from ae single 


origin will be proportioned ina way which is compatible 
with the gravity model, but the relative magnitudes of flows 


from different origins will depend on the geometrical 
arrangement of destinations around each origin and so can 
only be in accordance with the model in specific geometrical 
situationse 


DYNAMIC ATTRACTIVITY 


One characteristic of the model which has not so far 
been important is that the independent, predictor variables 
are usually assumed exogenous with constant values 
determined from physically measurable parameterse The 
analysis in the earlier sections was in the framework of a 
Single destinatione The model differentiates multiple 
destinations both on a geometric basis through D(i,j) and 
through the parameter A( Jj), the factor which explains 
differential flows when distances are constante To some 
extent A(j) will be exogenous, related in the recreation 
context to the physical characteristics of a site and its 
immediate surroundingse But in most cases A(j) will also be 
related to the use that site i is actually experiencing, and 
thus be at least partly endogenouse A feedback Loop will 
operate to make the site less attractive when the use is 
heavy, and perhaps also when it is excessively lLighte 

The existence of dynamic determinants of attractiveness 
in an interaction system is not necessarily a source of 
structural errore Provided A(j) is regarded not as an 
exogenous variable but as a parameter that can be expected 
to change whenever the set of I(i,j) changes, then no 
contribution to structural error will resulte But as suchy 
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the model will have no predictive powere Structural errors 
will occur under sets of hypothetical flows unless the A(j) 


are themselwes modelled in terms of the determining flows | 


and exogenous physical parameterse Cesario (Reference 1i) 
refers to this as Stage II of a spatial interaction 
analysise 


MINIMIZING STRUCTURAL ERRORS 


Both of the above sources of structural error can be 
minimised by an appropriate modelling proceduree Suppose 
that it is possible to assume that any set of individuals, 
when presented with a set of parks of given attractiveness 
at given distances, will proportion themselves in the same 
way, regardless of how many other alternative destinations 
existe In effect, this is the Luce choice axiom (see 
Reference 40)e Attractiveness may itself be dynamically 
related to use, and the actual flows from the origin to any 
site will depend on the alternatives available: the 
hypothesis concerns the relative proportions onlye 

The hypothesis can be written as follows: 


P( --> ilj --> ) FCAC I), DCigJ) 


P( --—> k[i --> ) FC ACK), DCi,gk) 
OR 


P(-——> jli —->) = £0 AC 5) »D( i,j) 


@E FCACK),DC i,k) ) 


WHEKE P( --> jli -->) is the probability that an individual, 
havine left origin i, wilt go to destination jy, and (k 
indicates the sum over all destinations available from 
origin ie In other words, the relative proportions visiting 
J andk are determined by the relative values of some 
function of attractiveness and distance for each 
cestinatione 

Now the probability that an individual resident at i 
will make the trip to j is: 


PCi --> j) = Pi -->) P(-—> j]a --—>) 


WHERE P(i-->) is the probability of leaving i to go 
anywheree Thus 


ICi,j) 


P(i) PC i --> Jj) 
PCL JOR CA ere CAC 3H 3, tf DCU Ls JF 
@E FC ACK), Dli,sk) 


i 


Then the flow from the origin to the jth site will be of the 
form 
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| 


Cig) = ECL) £€AC5),DC ded)? 


WHERE E(i) is a parameter which depends on the alternatives 
available from origin i as well as on p(i)de Provided E(i) 
and A(j) are regarded as parameters unique to the given 
flows and systems geometry, then, and not equated with P(i) 
or with observable properties of the site, neither source of 
structural error need existe if the model remains 
inadequate, it must be because incorrect assumptions are 
made about the function f,; or because the Luce choice axiom 
does not holde 


PERIPHERAL PROBLEMS 


| The total use of Rowan's Ravine Provincial Park in 1969 
amounted to 9,828 wehicles, or more properly their 
occupantse None of the. modelling procedures recognised this 
as a constrainte In linear regression, it is true that the 
‘sum of predicted values equals the sum of observed values, 
but not in non-linear regression or in cases where a linear 
model is fitted to transformed datae Boyet and Tolley 
‘(Reference 6) recognised this problem, but it is difficult 
to deal with in the standard methodological practicee 

| In Wilson's analysis (see Reference 63) there are 
constraints both on the total arriving at a destination, and 
on the total leaving an origin,y since ko and kd are not 
fitted parameters; but are solved for in the constraint 
equationse Iterative methods have been developed for fitting 
the model so that predicted and observed flows will obey the 
same constraintse 


SUMMARY 


The points raised in the paper can be divided into two 
groupse First, there are statistical issues involved in the 


fitting of models of the form represented by Equations 1 and 


2 to spatial interaction datae The conventional method of 
least squares on transformed variables does not give the 
“hest" fit to the model as that term would be understood by 
a recreation planner; the method does not minimise 
ciscrepancies between flows observed and those predicted by 


the modele The conventional measure of success is based on 


transformed variables, and shows quite different results 


when recomputed for the fit between observed and predicted 


flowse 
Second, several points were made about the structural 


validity of spatial interaction models, and the generality 


of model parameterse The paper outlined the conditions 


under which the model could be consistent with underlying 
general principles of human spatial behaviour, and thus 


regarded as a general modele 
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CORD STUDY TECHNICAL NOTE 36 


THE MEANING OF R@ 
IN USING ANALYSIS OF VARIANCE 
TO EXPLAIN PARTICIPATION 
IN RECREATION ACTIVITIES 


MeFe Goodchild 


ABSTRACT 


The probability of an individual's participation in a 
recreation activity has been modelled as a Linear 
combination of socio-economic variables, or traitse Such 
models are calibrated by using as the dependent variable a 
binary observation of whether each individual did or did not 
participate in the activitye 

The R@2 statistic for these models is shown to be 
dependent only on the statistical relationship between 
probability and binary observation, and in no way on the 
validity of the model. Estimates of coefficients are 
unbiased but standard errors are overestimatede 

An alternative index is proposed and demonstrated by 
Simulatione 


BACKGROUND 


CORD Study Technical Notes 6, 12, S420, and 29 
examined the use of analysis of variance aS a method of 
modelling participation in recreation activitiese Briefly, 
it was argued that in order to project demand for a given 
activity ina community, a model must incorporate shifting 
socioeconomic patternse Rising incomes and aspirations 
create changes in leisure habits which in turn are reflected 
in varying patterns of demand for both indoor and outdoor 
activitiese 

The model proposed characterizes an individual by a 
bundle ot socioeconomic variables, and postulates that some 
combination of these determines the probability ot 
participation in a given activity by that individual: 


p= al0O) + OE ali)x(i)+ & 

WHERE (&) is an error term, to be discussed below, 
aQeea(m) is a vector of constants, 

x(i) is the value of the ith socioeconomic variable, and 


m is the number of such variables; 
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and the summation is over me 


In many cases the values of x(i) will be restricted to 1 or 
0, corresponding to the individual's possession of the ith 
socioeconomic trait, or his lack of it, respectivelye (TN 12 
used the terminology of the analysis of variance, since all 
socioeconomic variables were considered as binary traits. 
The terms and symbols used in this paper are those of 
multiple regression, since many of the conclusions apply 
equally to independent variables measured on continuous 
scalese The Time Series Processor (TSP) was used for all 
simulations, in conjunction with small FORTRAN routines for 
data generation and post-TSP analysise ) 

To predict and project participation levels in a 
community as a whole, the model is simply summed over the 
members of the community; 


@E Pit j) : 
@E alo) + OE OE alidx(isj)t+ OE &( J) 
Na(o) + OE alijn(i) + &* 


E(n) 


WHERE E(n) is the expected number of participants, 
N is the metered oie of the community, 
n(i) is the population having the ith trait, and 
j represents a single individual; 
@E Pi, alo) and &(j) is over ny and 
@E @E alidxli,sgj) is over N and Me 


TN 12 illustrates the calibration of the model against 
the results of a survey of individual participation, and the 
use of the model in projecting recreation demande The 
present paper examines a series of problems in the use of 
the model, and in the interpretation of the results, 
concentrating on the disappointingly low R? values, or 
measures of the model's fit to reality, that are customarily 
obtainede 


THE BASIS OF THE PROBLEM 


In calibrating the model, the dependent variable is not 
an observation of the probability p that an individual will 
participate, but an observation of the individual's actual 
behavioure If the individual was observed to participate, 
the dependent variable is given the value of 1, and if not, 
Oe The relationship between this dichotomous variable and 
the probability p, is statistical: the probability that a 1 
will be observed in a single trial is py, and that a O will 
be observed, Il-pe 

The effect of calibrating with the dichotomous 
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variable, which will be referred to as y, rather than p, can 
be thought of as the addition of another error term to the 
modele The conventional error in p,y which was introduced in 
Equation 1 as (e), will be referred to as structural error, 
while the second source introduced by using y as the 
dependent variable will be called statistical errors While 
structural error is an empirical quantity, statistical error 
should be perfectly predictablee | 

Suppose that structural error is absent; so that the 
entire error in the model is due to the statistical 


componente Consider a very large sample, and tet the 


distribution of p values be such that the probability of) 
finding a case in the interval p to p + dp is f€p)dpe Since 
p must lie in the range 0 to 1 it must be true that: 

| 


t(F(p),dp) = 1 


WHERE the integration is over 0 to le 


The probability of observing y to be 1 is py and 0, 
i-pe With a sample of Ny the expected number of observations 
for which y = 1 will be Npf(p)dp, and for y = 03 
NC i-p)f(p)dpe So the observed sum of squared deviations 
between y's and p's, or between observed and predicted 
values of the dependent variable, will be: 
t(Npf(p) **(1-p)@dp taNF( p)**(1-p) P@,dp = Nf(p) pli-p)dp 
The total sum of squares for all such intervals will be: 
t(Nf(p) pli-p)dp) 

(integrating from 0 to 1) 


which is thus the sum of squares of y's about p'se 


The total sum of squares for the dependent variable 
y about its own mean is: 


t( NpF( p)dp)—-( tNpF( p)dp))2/N 
(integrating from 0 to 1) 


Thus the value of R? that will be observed in the absence of 
structural error has an expected value ot: 


(k2 — k12)/(k1 - 12) 
WHERE 


k1 
k2 


i 


tpf( p)dp and 
tp*f(p)dp 


(integrating from 0 to 1) 
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So the asymptotic value of R* due to the statistical 
error component can be predicted from the first and second 
moments of the distribution of pe Consider a simple examplee 
Suppose that the model predicts only two values of p, O and 
1e That is, under certain socioeconomic conditions, 
individuals will certainly participate, and under all other 
conditions will certainly note 

Suppose that the probability of either condition 
occurring is 1/2. 


Then k1 = 1x 1/2 + 0 x 1/2 = 1/2 
KQee 120 x47 204002 x 1/2. ="1/2 
R@ = 1 


Since p is restricted to QO and Il, the y's must be 
respectively O and 1, there is no statistical error, and the 
fit is perfecte 

Now let the two equally likely combinations of independent 
attributes give rise to p values of 1/3 and 2/3«e This time 
statistical error is present, and R2 deteriorates 
dramatically; 


k1 = 1/2; «2 = 5/18; R@ = 1/9 


The foregoing discussion is of course limited to the 
asymptotic casee In principle, it is possible to calculate 
the distribution of R® for samples of Limited size, but the 
tractability of the problem will depend very much on the 
distribution of py and therefore on the empirical x(i) and 
a(i)e For this reason the developments which follow are 
based on simulation of a few realistic cases, rather than on 
general mathematical analysise 


NATURE OF THE STRUCTURAL ERROR TERM 


Many factors influence the Likelihood of an 
individual's participation in an activity besides the socio 
economic traits x(i) included in tthe modele These include 
the effects of varying levels of the supply of opportunities 
for recreation (see TN 29), of the individual's own learning 
process, and of attitudes and perceptionse Furthermore, the 
model may not include interaction effects between two or 
more traitse Thus, it may be assumed that education is 
uniformly influential as a trait regardless of the presence 
or absence of other traitse But education may take part in 
interactive effects with other factorse If, for example, 
university education affects a certain participation 
probability only when coupled with high income, its 
influence will go undetected and appear in the error term, 
along with any other such interaction effects, or may 
actually distort the modele The point has been dealt with in 
detail in TN 206 
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It is possible, then, to conceive of @avariety of 
models for the error term in Equation 1 (for one type of 
error, see the Cicchetti and Smith article attached as an 
appendix to this volume )e It might be simulated as an 
interaction effect by taking a certain value when both of 
two interacting traits are present, and zero otherwisee The 
error term due to supply and learning factors might be 
appropriately modelled by the conventional regression error 
term, an independent, normally distributed variate of zero 
meane The latter approach was taken in the simulations which 
followe 

While the range of empirical P values is clearly 
restricted to between O and iy, there is no explicit 
requirement that the coefficients of the model be selected 
so that all predicted values of p lie in that rangee In 
Simulation experiments, it is quite likely that p will be 
driven above 1 or below 0 by the addition of large error 
termSe : 

In calibrating the model in other CORD studies this 
problem thas been dealt with by the use of a modified 
regression procedure which restricts predicted p values to 
the prescribed rangee In these simulations the range has 
been restricted by truncating any error which would 
otherwise have driven a p value above 1 or below Os But in 
the long term, the problem would be better dealt with by a 
respecification of the modele Suppose that Equation 1 were 
to become: 


P = (1/PI) arctan (a0t+ OE ali)r( i +E )+1/2 


Then the limits p = 0 and p = 1 would become asympotes 
such that even the most favourable or unfavourable 
combinations of trait variables never quite result in 
inevitable behaviour, and no restrictions are placed on the 
values of the a(i) or & Calibration of the specified model 
is more difficult, because in the appropriate linearized 
form all values of the dependent variables y become + 
infinitye But it would be quite possible to calibrate the 
model in the non-linear torm given above by an appropriate 
iterative proceduree 


SIMULATIONS (1) 


The simulations were made with a set of nine 
independent, binary socioeconomic trait variablese Each one 
was Simulated by generating a uniformly distributed random 
number in the interval O to 1, and then rounding to an 
integer, so that in each case the probability of the trait 
being present was 1/2. 

P values were calculated from Equation 1 with all 
constants set ot Oele The & values were normally distributec 
with mean O ana with standard deviation 02E determined for 
each Simulationy, so that the amount of structural error 
could be varied freelye The method of box and Muller (see 
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Reference 5) was used to generate normally distributed 
deviates from pairs of random numbers in the interval 0 to 
ile 
y values were generated from the p's according to the 
value of a further uniformly distributed random number in 
the interval 0 to le If this number was greater than pos y 
| was set to O, and vice versae 

| The first set of simulations demonstrates the 
dependence of R* on statistical error when structural error 
is absentee The results are shown in Table ile In each case 
the fitted vector of coefficients was compared to the 
original set used to calculate p valuese The deviations were 
expressed in standard errors, and the table shows the mean 
_absolute deviation and the standard deviation of the 
coefficient deviation, for each rune According to standard 
regression assumptions, the expected values are 0.798 and 
1-0 respectively. 

The distribution - of p, and the moments kl and k2 are 
readily calculated, since p is the sum of the constant a0, 
and nine equally weighted binomial variates x1 - x9. The 
probability that exactly r of these variates have the value 
1 and the rest 0 is given by the binomial distribution: 


Ptr (i-p)! n-r ((Cn-r)/r) 
WHERE n is the number of binomial variates, and 
P is the probability that any one has the value le 


The mean of the distribution is simply nP, and the 
variance nP(1-P) so that: 


it 


k1 9 x 1/2 x Oo1 + Ool = 0455 


k2 k1(2) = 9 x 1/2 x 1/2 x O5o1 x Oo1 x O1 = 020222 


Ré 020222/( 0-55 —- 0655(2)) = 0.0898 


This is the asymptotic value of R*, to which the values 
in the table tend as sample size increasese 


THE STRUCTURAL COMPONENT 


The structural error component can now be reintroducede 
Each individual value of p is distorted by a random quantity 
&S&» which is assumed to have a Normal distribution with a 
mean O and standard deviation dEe« 

When structural errors are included, the sum of squares 
about the regression line becomes: 
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aE (y-p)2 = t(i-p)? N( pte) fl p)dpt 
tp?N( 1-p-& )f( p dp 
= tN £(p) (pt&—p2-2pS )dp 


(integrating from 0 to i) 


Since only linear terms appear in the result, it 
follows that for moderate values of @E the existence of 
structural error will have no effect on R2.e The proportion 
of variance explained by the model will bea function of 
statistical error alone, and will not reflect any structural 
deficiencies in the analysise 

In the second set of simulations the statistical 
influence was held constant, and _ the amount of structural 
error varied by changing the standard deviation of G&e« The 
prediction that R® would remain constant for moderate values 
of OE is borne out by Table 2.6 The two measures based on a 
comparison of the fitted coefficients with the original 
simulation values show a quite systematic deterioration in 
the model's ability to recover coefficients as structural 
error increaseSe However, there is no such systematic effect 
on R2, which remains significant at the 95% level on the 
standard test and is as close to the asymptotic value of 
0.0898 when @E = i as it is when @E = 0O- The inevitable 
conclusion is that low R? values are a result of the nature 
of the dependent variable and do not mecessarily indicate 
any structural weakness in the modele 

It is clear from Table 1 that the statistical error 
component behaves in a similar manner to the error in a 
standard regression modele The two measures of coefficient 
recovery remain adequately ctose to their expected valuese 
However structural error behaves rather differentlye 
Increasing values of @E cause a deterioration in the 
coefficient measures, but without a corresponding change in 


Re The result is that the coefficient measures Show a 
trend to values that are well outside the expected range 
under standard regression assumptionse In short, the 


standard error of a regression coefficient can be calculated 
by the standard methods when statistical error is presenty 
but increasing structural error results in increasing 
underestimatione 


FURTHER ANALYSIS 


The analysis thus far has raised two major pointse 
First, the success of the model as expressed in Equation 1 
cannot possibly be assessed by means of R2.e Second, measures 
of the standard error in regression coefficients are biases 
when structural and statistical error are both presente So 
consider here that the preceeding analysis is pursued in 
more formal terms leading to the discussion of steps that 
can be taken to rectify the two difficulties just citede 

The fundamental problem can be expressed quite 
succinctly in terms of the ability of the y values to give 
accurate estimates of the properties of the p valuese Taking 
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) TABLE 1 


STRUCTURAL ERROR ABSENT ( @E = 0) 


Mean Standard 

Absolute Deviation 
Sample Error of Error 
Size Re ina ina 
20 ©4832 e943 1-048 
50 e2711 e459 e610 
100 e1085 0423 e611 
200 e 0863 1.01 12216 
500 01236 1-07 12277 

TABLE 2 


VARYING STRUCTURAL ERROR 


Mean Standard 

Absolute Deviation 

Sample Error of Error 
Size (N) o0E R2 ina ina 
200 0 0986 e490 e 600 
200 el ©1586 0184 e971 
200 e5 ©0528 e927 12147 
200 1-0 ©0831 12-730 22059 

<><><9<0><> 


the mean first, we have: 

Y = tP(y=1lp) P(p)dp = tpP(p)dp = p 

Thus the mean of y is an unbiased estimate of the mean of pe 
Now consider the second moment of ye 

tP(y=1lp) P(p)dp = p 

Thus the second moment of y is not an estimate of the second 
moment of pe It follows that measures based on the sum of 
Squares of py, such as R*, the standard error of regression 


coefficients, or statistical tests of the significance of 
R2, will be distorted when y is used as the dependent 
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variablee 
Now the crossproduct of y with an independent variable: 


ttxi P(x(i))dx(i) Pl y=ilp) P(p)dp = 
ttx(i) P(x(i))dx(i)pP(p)dp 


which is the crossproduct of p and x( i de So the regression 
coefficient estimates, which are based on the covariances 
and the variances of the x(i), will be unbiassede 

Stated in these terms, the problems stem from the 
impossibility of estimating the variance in values of pe 
When structural error is assumed absent, the variance is 
Simply that of the fitted p values, so no problem arises, 
but actual p walues may be distorted by the unknown 
component &<« 


A NEW MEASURE OF STRUCTURAL ERROR 


The only possible means of assessing the amount of 
structural error in the model is by a comparison of the 
actual and fitted p valuese While the observed y values 
reflect the actual, but unknown p values, they do so only in 
a statistical sensee In a complex problem, each individual 
probably possesses a unique combination of independent 
variable values, or socioeconomic traits, and thus a unique 
p valuee Thus each y value represents a single trial of a 
unique experimente (See the review of this chapter fora 
different comparison method than the one presented here 
based on the GLS regression methods described in the 
Cicchetti and Smith appendix to this volume. ) 

Suppose that the fitted p values were grouped into 
ranges; say 0 — Oc«ly Ool —- OQo2, etce Then in the absence of 
structural error, the proportion of those individuals whose 
fitted p values lay in each range should be roughly equal to 
the central p value of each rangee Thus approximately 5% of 
those individuals with a fitted p value in the first range 
should be observed to participate in the activitye The 
precise number will be governed by the binomial distribution 
so that the probability that precisely r individuals will 
participate is: 


P(r) = plc)!*¥r (1-plc)!**Cn-r ) ((n-r)/r) 
WEFRE the central p value is P(c), and 
n individuals have p values in the rangee 


Unfortunatelyy if we assume that structural error 
aistorts p values in a normal distribution with a mean of Oy 
then the proportion of individuals will remain roughly the 
Same, irrespective of the amount of structural error 
present, since upward distortions in p are as Likely as 
downward distortionse However, it is clear that towards the 
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limits of OO and 1, the normal distribution is not a 
reasonable model of structural error, Since near 0 upward 
distortions must be more Likely, and conversely near le This 
suggests, then, that the deviations in observed proportions 
from the central p values in each range, particularly near 0 
and 1 may be a reasonable measure of structural errore 

These ideas are now clarified with an examplee Figure 1 
shows the distribution of fitted p values for the simulation 
run s = 0, N = 200, that is, a sample of 200 generated with 
no structural errore Figure 2 shows’ the distribution of 
numbers of participants by fitted p value, so that each bar 
represents the number of individuals who participated, and 
who had fitted p values in that rangee Table 3. shows the 
corresponding proportionse 


TABLE 3 


FITTED p DISTRIBUTION 


p Value Participant Standard 
Central p Count Count Proportion Error 
e005 1 1 1.0 ez2 
e15 8 + e5* e13 
«25 15 0) OQ ell 
e395 31 13 042 209 
245 e194 16 e043 208 
Absa 42 16 238 e208 
265 34 28 e882 208 
aL 23 —618 e778 209 
e 85 8 6 e775 e13 
e25 1 | 1.0 e022 


* Probability of observing a value at least this different 
from the central p is less than 5%e 


<> <>< 25955 


The reliability of each observed proportion depends on 
the p value count in that intervale Clearly values toward 
the middle of the table are based on larger samples’) and 
should thus be given greater weighte In view of this, a 
better strategy might be to organise the distribution into 
intervals of equal numbers of observationse Table 4 shows 
the same distribution in the form of the ten deciles, so 
that each interval contains precisely 20 observations of 
fitted p valuese 

The proposed structural error index S(2) is based on 
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TABLE 4 


FITTED p DISTRIBUTION BY DECILES 


p Value Participant Std 

Central p Count Count Proportion Error 
0151 20 8 o40* 208 
e333 20 4 020 eil 
e386 20 5 025 ei1l 
e431 20 12 «60 ell 
e470 20 6 e30 ell 
«506 20 11 055 eil 
0553 20 ; 10 250 oil 
e610 20 10 250 eil 
e802 20 17 e85 209 


* Probability of observing a value at Least this different 
from the central p is less than 5%e 


<>< >< PKI <D 


the £it of observed proportions to central p values, when 
the data is arranged by deciles, weighted by the expected 
reliability of each proportion in a modified R2 statistic: 


DE (sli )—x€ i ))2/0 xi )—-xC( i )f ) 
se = {| - os->o---C deem ne ame ae ee ee ae me ae oe ee ee Oe ee re ee 
@E (s(i)—0-5)? /(xC€i J—x€ i)? ) 


WHERE s(i) is the observed proportion in the ith decile and 
x(i) is the central p value, and the summations are from i 
to 10. 

Table 5 shows the computed values for the index for the 
conditions used in Table 2e Each experiment was repeated 25 
times, and the table shows the observed mean and standard 
error of Se 

The index shows the expected trende The upper Limit of 
the scaley corresponding to @E = 0, depends on the sample 
size and will approach 1 as the sample increases and 
observed proportions become more accurate estimates of 
central p valueSe 


SUMMARY 


This paper has been concerned with models which relate 
an individual's socioeconomic characteristics or traits to 
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TABLE 5 


EFFECT OF.VARYING @E ON S? (N = 200) 


aE Mean S¢ Standard Error 
O e782 e120 
el e 766 0147 
e5 e599 e222 
1-0 e616 e187 
2-0 e271 e310 
<><><><K><> 


the probability of his participating in specific recreation 
activitiese Such models are calibrated by using as the 
dependent variable a binary representation of the 
individual's actual behaviour, which is only statistically 
related to the model probabilitye 

Under these conditions, the R* statistic is determined 
by the statistical relationship between probability and 
binary event, and inno way reflects the model's actual 
goodness of fit, or its structural errore Estimates’ of 
regression coefficients are unbiassedy, but since the binary 
event variable cannot give an estimate of the variance of 
the probability variable, R? and related statistics are of 
questionable valuee 

If each individual must be assumed to have a unique 
socioeconomic mixXy then each event is atrial under a 
different probability, and measurement of errors in those 
probabilities is extremely difficulte The method suggested 
is a measure of the goodness of fit between probabilities 
and proportions of events when observations are grouped into 
ranges of probabilitiese It is argued that structural error 
will produce distortions in this fit towards the ends of the 
probability spectrum at 0 and le 

Simulations were used to demonstrate the proposed 
indexe They show the expected behaviour under increasing 
amounts of structural errore Standard errors are quite 
large, and the expected values of the index depend on sample 
Size, on the form of structural error, and also on the 
parameters of the model, so that the index should be used 
only as a relative scale and not as an absolute measuree It 
would be wise to simulate the expected values of the index 
under the specific conditions of each applicatione 

Should the model be applied to cases where numbers of 
individuals have the same socioeconomic characteristics, 
many of the problems are much less severee Calibration can 
be carried out using observed proportions as the dependent 
variable rather than binary eventSe The reliability of each 
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proportion depends on the number of individuals involved, so. 
that the data should be weighted accordinglye The problem 
has been discussed in the context of recreation flow models 
in TN Noe 19, and many of the conclusions of that paper can 
be applied to the participation modelling probleme 
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GEOGRAPHIC DATA PROCESSING CONCEPTS 
AND THEIR APPLICATION TO PARK AND OTHER LAND USE PLANNING 


Me Fe Goodchild 


ABSTRACT 


} The issues surrounding the automated processing of 
\geographic data are subtle and complexe This paper examines 
the field from the particular stance of the land use 
planner; on the assumption that the major purpose of 
jautomated processing in this context is to answer questions 
ofa geographic nature that cannot be readily answered by 
jmanual map analysise In this context study areas are usually 
‘small, and study emphasis is more on the ability to provide 
rapid answers to complex questions than on the banking of 
large volumes of datae 

| Issues are reviewed under the general headings of the 
nature of geographic data, the basic pros and cons of 
‘machine processing, the alternatives in data storage, and 
the issues in the encoding processe The conclusions reached 
are consistent with the designs of several operating 
geographic information systemse 

! The PLUS system developed by the author is used to 
provide two sets of illustrations in the final sections of 
the papere One is of the encoding and verification of a 
typical geographic source document to produce an accurate 
data base for a variety of applicationse The other shows 
the PLUS/2 system in operation as it responds to a_ set of 
queries about a study area in Southeastern Manitoba from a 
fifteen-map data base of physical geographical variablese 


P.RREFACE 


: Early in 1973 the author was asked to prepare a paper 
for the Department of Indian and Northern Affairs on the 
then current status of geographic data processing and its 
potential as a tool for park planning (see Reference 23). 
That paper identified the major issues and reviewed the 
Various approaches that have been adapted to theme The 
Paper was discussed at a series of meetings with interested 
representatives of a number of agencies and led (in the 
Summer otf 1973) to a contract with the Lands Directorate of 
Environment Canada for the development of a Limited lLand—use 
planning system to satisfy a number of objectivese First, 
the system was to permit the manipulation of maps in a 
variety of ways in an interactive mode e This sort of 
facility is impractical in many geographic information 
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Land-use planning because the accuracy requirements are less 
severee Second, the system was to make the maximum use of 
existing data banks by providing interfaces to theme (The 
greatest obstacle to the adoption of geographic data 
processing thas been the problem of data encoding, which 
requires a large commitment to hardware and staffe Yet this 
obstacle is slowly being eroded as more data becomes 
available in agency archivese ) Finally, the system was to 
allow the input of raw data in limited quantities and in the. 
most efficient form possiblee 

The system became known as PLUS (see Reference 26), 
organised into two sections, PLUS/L containing the data bank 
interfacing and data input, and PLUS/2 the interactive map 
manipulatione A case study was used to provide test data and’ 
to demonstrate the use of the package (see Reference 25 )e. 
Interfaces were developed with the co-operation of the 
Canada Geographic Information System (Lands Directorate, 
1974) and the Canada Soil Information System (Dumanski and 
Kloosterman, 1973) and the system has been demonstrated in a 
variety of contexts (see, for example, Reference 27)c Data 
can be input and output in a variety of other standard 
forms, allowing direct interfacing with SYMAP, CALFORM and 
POLY VRT (Harvard University Laboratory for Computer 
Graphics, 1974) and US Census DIME files (US Bureau of the 
Census, 1970 )e 

The present paper reviews the concepts and important 
issues underlying geographic data processing; and provides 
illustrations and examples drawn from the author's work with 
PLUSe Development of the system is continuinge 


systems but is feasible in a system dedicated solely : 
| 


THE NATURE OF GEOGRAPHIC DATA 


The principal purpose of this paper is to review the 
field of geographic data processing in the particular 
context cf land use planninge (For a more general review see. 
Reference 12, 58)e It might be as well to begin with an 
attempt at a definition of the phrase "geographic data", 
before discussing the problems it presentse Consider the 
guestion "How much of the land that lies within five miles 
of a certain town is under 500 feet above sea level?" If a 
data base is to be capable of answering such a question, it 
must contain information on height and on locatione It might 
for example contain the precise locations of contour Lines 
in the areay through the locations of a sequence of points 
at which the contours change directione Or, as another 
alternative, the topography might be represented by accurate 
heights at every intersection of a very fine grid mesh laid 
over the areae With either method, an answer could be given 
through an appropriate series of arithmetic and Logical 
stepse 

The two methods mentioned above are often referred to 
as the polygon and grid systems of data encoding; each is 
eeographic because each contains explicit Locational 
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informatione As a counterexample, suppose that the 
topography had been represented by the average height of 
-land in each Census Enumeration Area on the mape It would 
then have been impossible to answer the original question 
without reference to the Enumeration Area locations and 
boundariese So a geographic data base can be defined as one 
possessing explicit geographic informatione 

The traditional method of storing, displaying and 
communicating geographic information has of course been the 
| map, a stylised two-dimensional representation of realitye 
All of the methods currently available for processing and 
|communicating information are one-dimensional, working with 
| a single linear stream of data, and geographic information 
must be represented in this way if any of the current data 


| processing technologies are to be applied to ite This 
| difference in dimensionality is responsible for a 
| fundamental paradox in all efforts at image or picture 
| processing, pattern § recognition, or geographic data 


processing; since the eye is a two-dimensional processor, it 
is often as or even more effective than the digital computer 
in certain kinds of operationse The computer is often only a 
| marginally beneficial processor of geographic datae 
Geographic data can be classified in a variety of ways; 
in a discussion of data processing it is particularly useful 
| to distinguish between point, Line and area datae A point 
| representation is often used when a map must show the 
Locations of objjects whose size is much less than the 
intervening distances between the objectse Line 
representations are used for such phenomena as railways or 
|) roads; area representations are used when characteristics 
| are to be ascribed to substantial tracts of land, as on soil 
maps for examplee 
| Several other rather specialised types of map can be 
classified as variations on this typology-e Contour maps can 
| be seen as (a) area maps on which the contours are area 
boundaries, or (b) tine maps on which heights are to be 
interpolated between neighbouring Llinese Points are 
'_sometimes used to infer areas, as when the locations of 
Census districts are given by centroids. 
| A map can display several types of information at once, 
but for data processing purposes it is difficult to deal 
/with more than onee The term ®coverage’ is used in the 
| paper to refer to a single type of data anda defined 
geographic areae So a coverage is a one-dimensional 
classification of a two-dimensional segment of the earth's 
surfacee An area-map coverage consists) of a single 
partitioning of an area into non-overlapping, homo geneous 
_zoneSe 
The relative importance of each type will of course 
: depend on the contexte However, the scale of most Land use 
planning is sufficiently detailed that the bulk of requisite 
information is areale Much of the relevant data, on soils, 
land use, vegetation, etce, is collected directly in areal 
forme In addition; most socio-economic data is collected by 
predetermined areal units and is therefore also areale On 
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the other hand, topography, climatic variables and water 
table depths are usually determined by point sampling and 
represented through contoured mapSe Highway rights of way 
are sufficiently narrow that they can often be dealt with as 
line datae In summary, while areal data predominates, all of 
the standard types can be recognised as relevant to land use 
planning. 

Finally, accuracy is an important issue which can 
easily be overlookede The paper touches on the question at 


several points, as the degrees to which the results of data 
processing reflect the information on the source mapse There 
are complex payoffs between the costs of processing and the. 
accuracy of the resultse Basically, there is no value in. 
providing accuracy in data that will never be used in 
analysise Nor is there value in an elaborate system which 
can provide precise answers if they are based on inaccurate | 
datae An air photo analyst may subjectively draw a line. 


which encloses land of a roughly homogeneous soil type; yet 


the data processing system will objectively ascribe the. 
precise soil type to precisely all of the land within the > 


Linee 


The Case for Machine Processing 


Geographic data processing is a relatively new field. | 
Although it has been possible to handle geographic data) 
Since the earliest days of electronic computers, it is only | 
through the enormous reduction in costs per operation over | 
the Last 20 years, and the simultaneous increases in storage | 
capacity, that processing has become’ economically feasible | 
on a realistic scalee (The points made in this section are 


discussed in more detail in Goodchild (19793 )e ) 


Three types of geographic data ‘manipulation are. 
considered in this sectione The advantages of automatic | 
processing are assessed in the context of mapping; then in| 
relation to geographic information retrieval, and finally in) 
the analysis of geographic relationships and model | 
calibratione All three areas are to some degree relevant to. 
present day land use planning, and are assessed in that. 
contexte But, as well, it is important to bear in mind that | 
the development of geographic data processing techniques may | 


open new directions for land use planning in the futuree 


Automated Cartography 


Once geographic information has been coded, it can 
readily be retrieved in graphic form as a computer-produced 
map (see Reference 48)e Two main methods exist, based on 
the plotter or computer-driven pen, and the line printere In 
Canada, both the Canada Land Inventory, through the Canada 
Geographic Information System (Lands Directorate, 1974) and 
the Department of Agriculture, with the Canada Soil 


Information System (Reference 15) have produced massive data 


files of coded information readily usable for producing 


plotted mapse Canada Land Inventory coverages are described 
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in Reference 15+e Similar efforts in other areas can be found 
| in the GRDSR program of Statistics Canada (Reference 55), 
| the National Topographic Survey of the Department of Energy?» 
Mines and Resources, and other agencies, both federal and 
_provinciale 

| In the Line printer case, relatively crude maps have 
usually been made by overprinting symbols to create 
different shadings (see Figure 4) although more recent 
variants such as the electrostatic plotter produce more 
acceptable results by a dense application of small dotse 

| At the simplest level, automatic cartography is not an 
efficient way of making a mape It would clearly take at 
least as long to describe the map to the machine by encoding 
it as it would to plot it by hand. But there are many sets 
of circumstances which may justify computerising a mape 
| First, once encoded the map is stored by the computer so 
that the benefits of encoding extend beyond the cartographic 
exercise. The data might be used to plot the same map at 
different scales or to draw only part of it: or The coded 
map might be used as ai. source of information in some 
analytic probleme Second, there are cases in which the same 
| computer-coded base might be used to produce different mapSse 
For example, once the outlines of census areas are coded it 
is possible to produce innumerable maps of different census 
variables very cheaplye Furthermore, one may wish to process 
the map between the coding and drawing stages. It might be 
_ necessary to change projection or scale: or computer 
processing might be used to make corrections and 
modifications before drawing an updated final copy of a mape 
And finally, a great deal of work in the printing process 
Cin the preparation of colour separations) can be avoided by 
effective automatione 

| But while each of the operations mentioned may be very 
relevant to the routine production of topographic, soil or 
census maps by the appropriate agencies, the land use 
planner is unlikely to justify encoding geographic data for 
the sole purpose of automated cartography. He is more 
likely concerned with a unique, Limited study of a specific 
‘area, with very Little likelihood that the data will be 
useful to other studies in the future, at least without 
extensive updating and revisione 


Information Retrieval 


The land use planner must work with answers to a wide 
variety of geographic questions, ranging from the simple 
"What is here?" on a single coverage to complex comparisons 
of several coverages, such as "How much land of types xX; y 
and Zon coverages Ay B and C lies within N miles of 
location L?", or problems involving a search over feasible 
locations to find optimum sitese Some of these questions can 
be answered by eye, and it is not difficult to construct a 


case in which the eye can accomplish an operation 
(evaluation) more rapidly than a third-generation computer, 
fiven a clear map _ and a prior encodinge Even when 
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information must be collated from several MAPS » the 
advantages of machine processing are marginal at beste 

Other similar questions fall into the same category, 
like "Where is the boundary between these two areas?". But a 
second category of operation can be specified: those that 
are marginally feasible by hand for which there is a clear 
advantage in machine processinge Any evaluation of area; 
however simple, is a cumbersome and tedious operation if 
done manually by the traditional methods of planimetry or 
dot countinge It is especially so if qualifications are 
placed on the evaluation, such as "How much land of type x 
occurs in blocks of greater than 100 acres?" or "within five 
miles of town L". 

A third category of information-retrieval operations 
consists of those which are virtually impossible by hande 
These are new capabilities provided by machine processing, 
rather than old capabilities to be compared to manual 
methodse Included in this category are many of the problems 
which involwe the comparison of two or more coverages, such 
as "How much land is of type x on coverage A and type y on 
coverage BP". The preparation of a composite map by the 
arithmetic or logical combination of two or more simple 
coverages is of this complexity, or operations of the form 
"Display all land on coverage A that has value y on coverage 
Bb". Again, different types of data might be mixed, as in the 
problem "How much land of type x on areal map A lies within 
one mile of a railway right of way on Line map B?'. 

The second and third categories of information-—- 
retrieval operations are likely to provide the strongest 
justification fora geographic information system in the 
planning field, designed for and as far as possible operated 
by planners themselvese But this is only one of the possible 
orientations of a geographic information system; for 
example, the data-acquisition and cartographic functions of 
the Canada Land Inventory dictate an entirely different set 
of criteria for the Canada Geographic Information Systeme 
This paper adopts the planning orientation, and should not 
te interpreted as a review or critique of the entire field 
of geographic information systemse 


Advanced Analytic Capabilities 


Computationally, there is no Limit to the complexity of 
operations that can be programmed into a geographic 
information systeme Thus far we have considered three broad 
categories of data manipulation and information retrieval, 
but the potential is obviously much greatere But there isa 
basic dilemma in going much furthere Since few of these 
operations have been available to land use planners in the 
past, the profession has had to evolve largely without theme 
So understandably most planners are at a Loss to sugeest 
additional capabilities, or even to structure their existing 
procedures along appropriate Linese The precision and 
objectivity of data processing are Largely foreign to 
methods that have evolved without mechanical aidse So while 
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the capabilities described above may fit directly into 
planning practice, more advanced operations are likely to do 
so only in the long terme 

Consider the following examplee A power Line is planned 
across productive agricultural land and must follow the 
route of minimum cost, defined as the composite of 
construction cost, land acquisition, transmission cost and 
‘agricultural production tlosse Potential agricultural 
production can be calculated from agricultural capability, 
which is available in map form, but the actual relationship 
has yet to be established. Advanced geographic data 
processing capabilities might be used in this context, first 
to establish the relationship between potential production 
and land capability (by regressing past yields against the 
map of capability) and then to select the optimum site by a 
minimising search algorithme Whatever the set of operations 
finally considered useful for land use planning, it is clear 
‘that any system must have the potential for rapid and easy 
addition of new capabilities as the need for them becomes 
apparent. 


THE ISSUES IN MACHINE STORAGE 


| In discussing the costs and benefits of various 
“approaches to machine storage of geographic data sets, there 
are three separate sets of criteria to be considerede First, 
there are alternative methods for actually encoding data, 
each with its own advantages and disadvantagese Second, 
long-term storage presents its own problems of efficiency 
and reliabilitye And third, there is a wide variety of 
eventual applications of the data which must be reflected in 
the methods of storage and retrievale 

The two methods of automated cartography, plotter and 
printer, correspond precisely to the two major methods of 
data storage, polygon and gride Plotter maps are drawn by 
driving a pen around each of a number of boundaries; polygon 
‘data is stored as a sequence of points whichy when 
connected, indicate polygon boundariese Similarly, printer 
maps are created by placing a symbol in each of a finite 
array of cells; correspondingly, grid data is stored as a 
collection of discrete values associated with grid cellse 

| Within each of the two classes there are innumerable 
variantse The record in a polygon data set can consist of 
all that land classified by a certain code, or a complete 
contiguous polygon, or a segment of a polygon boundary 
between two junctions, or a single boundary pointe These may 
usefully be referred to as lewels 1, 2, 3 and 4 files 
respectively. Certain variants contain more than one levele 
The SYMAP structure is a pure level 2 form, whereas’ the 
CALFORM structure combines level 4 and level 2 subfilese 
(See Reference 30. ) 

The grid data structure may vary according to the 
convention for ordering cells, although the commonest by far 
is the basic printer sequence, by rows from the top left 
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cornere More importantly, the structure may be compressed so 
that extensive runs of the same cell type are replaced by 
integral multiplierse The literature on data structures is 
extensive; for reviews see References 00, O00. 

The accuracy with which polygon data represents its 
source document depends on the accuracy with which each 
boundary line is encodede If the boundaries are excessively 
contorted, they may require a high density of points to 
create an accurate representation, while the targely 
straight Lot lines of a tand use map can be coded witha 
much lower densitye But in general a moderate number of 
boundary points are sufficient to make an adequately 
accurate polygon data sete 

By contrast, a grid data set has a definite and 
predetermined level of accuracy once the grid cell size is 
sete Relationships exist between grid cell size and the 
reliability of any estimate of area, or any other geographic 
parametere (See Reference 24, S7-e-) Yet in spite of the 
sacrifice of precision, gird data has a very clear advantage 
in any of the more complex forms of geographic data 
processinge Maps can be made equally readily from either 
data type, but any operation which requires the evaluation 
of land type at a specific location; or the comparison of 
different maps, is far more readily executed with grid data 
than with polygone With grid data, it is a simple matter to 
calculate the cell in which a point Lies, and thus to 
retrieve its characteristics, but with polygon data it is a 
substantial problem to associate a specific polygon with a 
given pointe 

For this reason most adwanced data processing of the 
type outlined above has been carried out on gridded data 
setse The sacrifice of accuracy has been judged to more than 
compensate for the excessive computational problems of 
information retrieval from polygon datae Optimally, each 
coverage is coded by a precisely similar grid, so that the 
nth cell in the first grid corresponds to the nth in every 
othere 

But while information retrieval and analytic operations 
are best performed on grid data, there is no need to store 
all data in that form, as transfer is easy between the two 
types, particularly from polygon to gride Grid data sets are 
only rational once a level of accuracy and a study area have 
been determinede Thus one finds that the polygon structure 
has been adopted by most agencies with responsibility for 
data collection, storage and cartography, while grid data is 
favoured for sophisticated retrieval and analysise Ideally, 
data should be collected and stored in polygon form; and 
then gridded for each specific study area and level of 
accuracye Collecting directly in grid form places too many 
restrictions on the usefulness of the data, while analysis 
using polygon data is bound to be inordinately expensivee 
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THE ISSUES IN DATA ENCODING 


The encoding of point or line data presents no 
particular problemse Co-ordinates can be taken directly from 
source documents with readily available digitising 
equipmente Grid coding of areal data is Similarly 
elementary, though unlikely for the reasons discussed in the 
preceding sectione But the encoding of areal data in polygon 
form has historically been one of the main stumbling blocks 
of geographic data processinge In the simplest manual system 
a digitiser operator indicates a sequence of points that he 
wishes to have encoded, together with the contents of each 
polygone There are many variants of the process, depending 
on the degree to which computer processing is used in the 
eventual editinge A few possibilities are discussed below. 


The SYMAP Method: Complete Polygon Encoding 


In this system, the polygon constitutes a file record 
(a level Z file according to the above notation)e The 
operator codes a series of points defining each polygon 
outline in clockwise order, and then codes the contents of 
each polygone See Figure 1(a)e As a result every boundary is 
coded twice, as part of the polygon on either sidee The two 
versions of each line will inevitably conflict, so the crude 
image is often processed to resolwe any deviations of less 
than some allowable amount, say 0-05-inch, to create a clean 
filee 


The CALFORM Method 


The file cleaning process mentioned above is 
potentially dangerous, since any real detail in the map at a 
scale of less than 0.-05-inch will be removed along with the 
spurious detaile The CALFORM method attempts to resolve this 
potential ambiguity and to economise in effort by avoiding 
the double encoding of each tlinee Every polygon vertex on 
the map is first numbered and location encoded in a level 4 
filee Then the polygons are coded by listing the sequence 
numbers of their vertices, to create a tlevel 2 filee See 
Figure 1i(b)e Although the method becomes unmanageable when 
large numbers ot vertices have to be numbered, it avoids 
both of the difficulties of the SYMAP systeme But a 
realistic encoding method must be made to deal with 
thousands of polygons and hundreds of thousands of vertices 
on a single mape 


PC( Segment) Methods 


PC methods divide the polygon networks into sections of 
boundary between junctions (a level 3 file) and so avoid the 
duplication problem of the level 2 methodse See Figure i(c)e 
But each junction now creates a problem, since it occurs at 
the end of each of several records, and may be coded ata 
Slightly different point each timee So a certain amount of 
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processing must be used to resolve discrepancies to within 
an allowable Limite. A variety of methods have been devised 
to resolve discrepancies in an unambiguous waye In some 
cases each junction is encoded in a separate digitising 
phase; in others, each segment is named according to the 
polygons it boundse 

PC encoding can become a complex and confusing task for 
a digitizer operator, who must continually switch between 
point encoding and the identification of polygons. In some 
variations of the method, polygons are identified in a 
separate phase of the encoding process after the entire 
point encoding is completee This is easier for the operator, 
but requires a more sophisticated processing stage to 
resolve more complex ambiguities at junctionse The IC method 
implemented in PLUS/1 is one example, see Reference 25e The 
operator first codes the boundaries by identifying vertices, 
but without necessarily beginning and ending records at each 
junctione The polygons are then identified by encoding one 
point within each one and giving its characteristicse 
Computer software identifies all Junctions, breaks the 
encoded boundaries at appropriate points, and attaches each 
polygon type to the segments forming each polygon boundarye 


Semj-automatic Methods 


The encoding methods described above can be automated 
in various waySe First, the digitising process can be 
accelerated if points are coded automatically, at given 
intervals of time or over given distances, rather than by a 
conscious act of the operatore Second, a great deal of work 
has gone into making the Line-following process semi- 
automatic, with a Limited amount of operator control to 
resolve ambiguitiese The map is usually scanned by a device 
similar to a TV camera, moving wunaer the control of a small 
"dedicated" computere Ambiguities are brought to the 
operator's attention when they cannot be resolved 
internallye (For example, Reference 8e ) 


Fully Automatic Coding 


The technology necessary to read an entire map and 
place it in computer storage has existed for a long timee 
Unfortunately, such a vast amount of data is created if the 
map is scanned with any degree of precision that the cost of 
sorting out polygons by computer processing is inordinately 
highe In addition,y most source documents must be redrawn 
prior to scanning to remove any marks that do not represent 
polygon boundaries, an operation that often occupies as much 
human effort as manual digitisinge But several trends 
suggest that such systems will enjoy increasing popularity 
in the futuree Computer processing is becoming cheaper, 
while the cost of manual operations is increasinge And with 
better software, it should be possible to scan images that 
are closer to the source document in quality, without 
requiring expensive redraftinge But at the present time the 
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PC (Segment) methods discussed above are clearly the most 
practical and economical for the creation of polygon data 
sets except in the Largest agenciese 


Editing Data Sets 


These are two fundamentat problems in editing or 
updating areal data sets and they are largely responsible 
for the difficulties which atl practical systems have 
experiencede First, there are internal consistencies within 
any polygon data set, so that changes may have to be made at 
several points in the file at oncee For example, if a 
boundary point location is changed in SYMAP data set, the 
change must be made to at least two of the coded points in 
the filee Or if a polygon characteristic is changed in a PC 
file, it must be changed in all of the records which make up 
the polygon boundarye So it is essential that editing be 
done using specialised.software so that internal consistency 
can be preservede 

Secondly, there is a fundamental difficulty in relating 
the contents of a file to the appearance of a mape The 
coordinates of a point cannot be associated with particular 
Lines on a map without the aid of a digitiser or plotting 
device, so both machines are usually considered essential to 
the editing procesSse Some editing systems use graphics 
terminals to display sections of a data set, so that the 
user can identify the location of errors on the original 
source documente Errors can then be corrected with the 
terminal cursor; which the operator can position to revise 
locations or to indicate changese This sort of man/machine 
interaction appears to be the best solution to the editing 
of polygon data, but suffers at the moment from the 
smallness of graphics terminal screense The ideal system 
would operate at the same scale as the source document, 
perhaps through a combined digitiser/plotter system witha 
head that could be driven oor positioned manually, in an 
interactive modee 


EXAMPLES 


The Encoding Process for Polygon Data 


The tirst example illustrates the preparation of an 
areal data set of the polygon type from the source document, 
using the PLUS software packagee The map chosen for encoding 
is one showing the pattern of Census Enumeration Areas in 
the city of London, Ontario; there are 478 separate polygons 
on the map, mostly with simple rectilinear outlinese The map 
was coded using a simple manual digitiser, with a cross hair 
cursor connected by a rigid arm and steel cables to two 
incremental counterSe A digitiser operator pressed a foot 
switch whenever a point was to be encoded, causing the 
point's x and y co-ordinates to be recorded on a punched 
card, to the nearest 1/100-inche The overall accuracy; 
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expressed as the ability of the operator to find and encode 
the same point at infrequent intervals, is about 0.05-inche 

The IC method discussed briefly above was used to 
encode the mape In the first stage the operator coded the 
entire image as a network of lines, breaking the sequence 
whenever necessary to move to another section of imagee 
Figure 1(d) shows a typical sequence and should help to 
clarify the methode In a second stage, a Single point is 
Located arbitrarily in each polygon, in this case its 
Enumeration Area identification numbere This primary coding 
operation occupied the digitiser operator for ten hours, in 
which time roughly 10,000 image boundary points were coded, 
and 500 identifierse 

As a first step in correcting the errors in this raw 
data, the image and centres were plotted out at the same 
scale as the source document (Figure 2)e Several kinds of 
error can be detected immediately, such as those which 
result when the operator fails to properly indicate a break 
in the encoding of the image, or when sections of image or 
centres are omittede Another two hours were spent in 
modifying the data to remove such errorse 

The data were then processed by a PLUS/I1 programme 
package (POLYSORT) designed to identify junctions, make 
appropriate breaks in the Line image, and attach the polygon 
identification to the appropriate side of each PC recorde At 
the same time, any gaps at junctions caused by an undershoot 
or overshoot during digitising were removed to within a 
certain tolerance (Qc«el-inch)e Nevertheless, errors could 
still exist in the file at this stagee Junctions could fail 
to close within the prescribed tolerance, or there could be 
problems that were not visually obvious at the earlier error 
correctione More seriously, the use of an error tolerance 
means that it is difficult to resolve real detail at scales 
approaching that tolerance without ambiguitye 

Three checks for potential errors were made at this 
stage, besides those already describede First, the internal 
consistency of the file was checked by Linking together all 
of the PC's forming each polygon boundary, to ensure that no 
gaps existed and that each was coded with the correct 
polygon identification. Any errors were flagged by the 
PLUS/1 program (EDITONE) so that they could be checked and 
correctede AS a second check, each polygon identification 
was verified from the original mape Any changes must 
maintain the internal consistency of the file and so must be 
made by an appropriate program package (EDITTWO)e Finally, 
the PC file was plotted at the scale of the source document 
to ensure that the boundary locations were correct (Figure 
3 )e 

A clean polygon data set can be used ina variety of 
wayse The codeaq map can be gridded at any scale, with any 
size of grid cell (GKID)e See Figure 4-e It can be used to 
calculate areas, perimeter Lengths, centroid locations and 
other eeographic summaries (RPG); it can form the basis for 
mapping using one of the standard polygon mapping packages 
(SYMAP or CALFCRM) and if necessary the data structure can 
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be modified to any variant of the polygon form (STRUCT)3 and 
polygon maps can be overlain (OVER) to produce composites in 
which each combination of polygons becomes a new; unique 
figuree (GRID, RPG, STRUCT and OVER are PLUS/1 routines 
described in Reference 27. ) 


information Retrieval with Grid Data 


The second example demonstrates the use of geographic 
data inan information system using PLUS/2e Under this 
system, answers to a variety of queries are available at a 
user's terminal (a teletype or cathode ray tube device) in 
response to simple commandse In the following pages results 
of an actual run are photographed from a screen in precisely 
the way a user would see it, with underscoring applied to 
everything that would be typed by the usere The actual case 
is taken from a study of the Roseau River basin in southern 
Manitoba (Reference 27.)« Data on the area has been gridded, 
based on 168 columns and 36 rows, so that each map cell 
corresponds to one quarter section of Lande Fifteen 
coverages of the area were stored in this grid forme The 
"resource value" recorded for each cell on each map coverage 
is the predominant value of the appropriate variable in the 
area covered by that celle 

A computerized "planning" session begins (Figure 5) by 
the user asking for assistancee The List of commands 
available in PLUS/2 contains several routines for the 
analysis of map data, for the preparation of summaries, 
display of maps, creation of new maps and the combination of 
existing coveragese The command *COVERAGES* produces a list 
of maps in the system; these names, along with the size of 
erid area and other basic information on the study area are 
passed from session to session: they can also be entered by 
a dialogue initiated by the "RESTART! commande 

Incidentally the PLUSe2 system recognises four kinds of 
coverage, Alpha, Numeric, Lines and PointsSe Alpha and 
Numeric maps both consist of cellular arrays, but differ in 
the allowed List of operationse The values on a Numeric map 
are assumed to be measurements on some scale, whereas no 
interval oor ordinal relationship is assumed between the 
alphabetic or numeric symbols on an Alpha mape 

After finding out what converages are available the 
user might ask for a display of one of the maps in the 
systeme "AGD?* iS, in reality, the primary agricultural 
capability class, on a scale of 1 to Te Because only 72 
print positions are available on the terminal being used, 
the system asks for a part of the map to be specifiede It 
has previously determined that the map must be displayed as 
Six *pages', two rows and three columns, so that page 1,2 
corresponds to the top centre of the mape Page 1,1 is the 
top left, and page 2,3 the bottom righte (See Figure 6.) 

These maps are crude images that can be produced at the 
terminal in secondSe More permanent maps of published 
quality might be produced from the same data by directing 
output to a line printere A command "PILOT! is available in 
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PLUS/2 when permitted by the specific computer system in 
uSC€e 


What Proportion of Land is Class 2? 


In displaying a map, the system is merely making a 
crude reproduction of a source documente But the same data 
can be used to answer a series of questionse Consider first 
a question which can be answered through the "*TABULATE® 
commande Since the AGD file being considered is Alpha, the 
values on the map are reported in the order in which they 
are found, and in the actual request, the output shows that 
12-<3% of the area Class 2, a total of 742 cells (Figure 7)-. 


How Much Land is Class 2 
With an Average Water Table Depth 
ef 12-30 Inches? 


The coverage '*SlLG!® contains a classification of the 
depth to water table, using the number 3 to indicate depths 
in the range 12-30 inchese So the question above requires a 
comparison of two maps and an evaluation of the area over 
which two classes coincidee The PLUS/2 ‘*CROSSTAB* command 
will tabulate all coincidences between two coverages; it 
shows that of the 742 cells of Class 2 agricultural 
capability, 126 have the required class of water table depth 
(Figure 7)e This corresponds to a total area of 31-5 square 
mileSse 


How Much Class 2 Land 
Lies Within 10 Miles of Stuartburn? 


This query can be answered with the 'DTAB!* command, 
which analyses a map according to distance from a fixed 
pointe The Location of Stuartburn is given in terms of the 
network of cells, as row 18, column 50. The user now 
indicates how the map is to be analysede Each cell in the 
map will be identified in two wayS, according to its type, 
and to its distance from Stuartburne The user defines ranges 
of distance, by indicating the upper Limit of each range, 
and the symbol the range is to be given in the output tablee 
Distances must be given in terms of cells, bearing in mind 
that each quarter-section cell is one half mile acrosse By 
specifying Limits of 20 and 200 celis, the user is in effect 
indicating ranges of less than and greater than 10 miles, 
Since no distances of greater than 200 cells occur’ on the 
mape From the output, Figure 7, the user infers that only 9 
cells have the required characteristic, the majority of 
Class 2 land lying outside the ten mile radiuSse 
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Final Plot of Example 1 
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FIGURE 5 
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FIGURE 7 
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FIGURE 8 
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FIGURE 9 


COMMAND SRETTFE 

WANE OF FILE TO RETYPE T2105 
FILE TYFE ALPHA 

Heme OF THE MEW FILE Fel 
FILE IOMFLETE 


-, 


COMMAND TREGR ED: 


WANE OF THE *% VARIABLE ¢ HAI 
FILE Tree HUMER IC 


mane OF THE WV VARIABLE HS1 
FILE Tre NUMERIC 


ENTER MILELING DATA CODe: FOr AND v vet 

FREGRFES=Z1ON EQUATION ¥ Sanit Ciera + -Neids0sd4 & 
KORPELATION O.12823034 F SOURED CS PCIE RY Sd 

me -THpl Ti FOR TEST ‘URS avira i 


WITH 2ea4d DF 

2 20M 10635 Medan 3. S508 
Y Sr 1107S MEAN 4. 01° 73 
lum x 20 4dbeeo VRRTANCE 1. 
Bind og SG 47617 YARTAHCE 1 
SA oo scesr 


MUABER OF FOINTS ese 


Sar4 DEVIATION 1.36376 
S03 DEVIATION 1, 06538 


LOMMANO FCOMB INE 


Mame OF THE FIRST FILE In COMBINATION FHAS 

SrLestrre MUMEr IL 

HaMeE OF THE 22COnb FILE tHe 

BIce TPE MINER TL 

WHAT TS THE Heb FILE TYPE TO BE PHUMER TE 

Hane OF THE FINAL FICe Pho 

OPENING FILE In FOLITION 15 

Seu ror, PILE Bris 

EHTER THE ADDED CONSTRHT Ps. 

ENTER THE WEIGHT 7.2 ? 

BaATER THE FOWER: 2 

EHTER THE MuMBer OF RANGES FOR A RECOLE - ELSE cEFO Fo 
Berths §Gr FILE Hel 

ADDED CON TANT eo 

WEIGHT f.4 

FOweEr Fi 

PANES FUL 

COMBINE DETIONS - ENTER 1 FOR AnD. & FOF MULTIFLY: 5 FOr MHZ. 
PILE CREATED | 


page AlL7 


FIGURE 10 
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FIGURE 11 
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FIGURE 12 
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FIGURE 13 
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FIGURE 14 
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FIGURE 15 
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Can Agricultural Capability 
Be Predicted From Water Table Depth? 


Neither £AGD* nor *sucé were initially declared as 
Numeric files, although both contain exclusively numeric 
symbols, which in turn represent values on crude scales of 
capability and depth respectivelye So they both satisfy the 
requirements of Numeric files, and their types are changed 
using the *RETYPE® command to form files "NAG? and "NSL'*. 
The * REGRESS? command can now be used to test for a 
predictive relationship (Figure 9)e Any celts containing the 
code 'O' in either file are omitted, since the code denotes 
missing data, leaving 2,756 cases to be evaluated by Linear 
regressione The correlation of 026183 indicates that a very 
weak relationship is present, with good capability (low 
values) corresponding to shallow water tables, but that many 
other factors also affect the capability indexe 


Yield Prediction 


Suppose that past analyses have indicated that hay 
yields, in thousands of pounds per acre, can be predicted 
from the equation 


Yield = 3 + O0e5 + O04 SIG 


The "COMBINE* function can be used to produce a new coverage 
in which each cell shows the combined yield prediction from 
the capability and water depth coveragese The options in the 
command allow a wide variety of algebraic and tlogicat 
combinationse See Figure Ge 


How Much Class 2 Land 
Lies in the Municipal District of Franklin? 


The outline of the District is used to make a coverage 
"FRA* showing the critical area with the symbol F and the 
rest of the map as Oy by invoking the *POLYGON® commande A 
crosstabulation of "FRA*® with "aAGD* then shows that 671 
cells have the required capability classe This and other new 
coverages created during the session now appear in the 
"COVERAGES! List with appropriate typese (See Figure 11.) 


What is the Largest Continuous Block 
Ci Class 4 Forestry Land? 


The tabulations above have paid no attention to 
continguity, so that a total of 100 cells may exist asa 
continuous tract, or as 100 small fragmentse The system can 
produce summaries of contiguous areas through the 'CONTIG® 
commande An analysis of the map ot forestry capability, 
"FGD*, shows that rie | cells, or 54.425 square mites are 
available in one unite (See Figure 12.) 


How Much Class 2 Land 
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Lies Within 225 Miles of the CNR Right of Way? 


This is answered in three stepse First, the location of 
the right of way is supplied to the system through the 
"LINES! command, by giving the Locations of points at which 
the Lines in the network change directione Then the "STRIPS! 
command creates a coverage based on the Lines’ file by 
distinguishing the cells within a critical distance of the 
network by a particular symbole Finally, a crosstabulation 
of this new coverage with "AGD" shows that there is no Class 
2 land within the critical stripe (See Figure 14. ) 


CONCLUSION AND SUMMARY 


The second example illustrated the use of a geographic 
information system to answer a set of queries that would be 
largely impossible by the manual analysis of mapped 
informatione The kind. of system exemplified by PLUS/2 is 
capable of providing answers to complex questions both 
rapidly and cheaply, using data stored in grid forme While 
it is possible in principle to perform the same operations 
on polygon data, the computer processing times and costs are 
much greater and far outweigh the corresponding increase in 
accuracye Furthermore, it is doubtful if a user could cope 
with the volume of data produced in an analysis of polygon 
data setse The size of a grid cell ina grid analysis can be 
adjusted to provide the level of resolution and 
generalisation appropriate to a particular study, whereas 
polygon data is constrained to a constant, high Level of 
precisione 

The major expense of the system described here, and for 
that matter of all information systems, lies in the 
collection preparation and maintenance of the data base€e 
This paper has been written from the planning point of view; 
on the assumption that the agency using such a system has no 
explicit responsibility for the acquisition of any 
particular kind of data, but rather is concerned with making 
the best use of geographic data sets maintained by other 
agencies, such as Statistics Canada or the Canada Land 
Inventorye As such, the final objective of the system must 
be the ability to respond to queries such as those in the 
second example; data storage and cartography are not likely 
to be major objectives by comparisone The setection of data 
structures is thus dictated by the need for grid form of 
levels of accuracy can be decided in advancee But more 
freguently a planning study will require various levels of 
accuracy as it moves from general studies of an area to 
detailed examination of critical zonese In such cases it is 
appropriate that data be first encoded in polygon form, and 
then overlaid with various grid cell schemes as necessarye 

These arguments can be summarised ina scenario for a 
typical studye The area to be studied is first delineated, 
and enquiries made to determine the amount of data already 
available in various agency data bankse Such data is likely 
to be of polygon form, Since agencies with a responsibility 
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for acquiring data witt usually avoid any loss of resolution 
in the encoding processe Additional data will be needed, 
besides that available inidata banks, and must be encoded. 
from maps» To avoid a premature choice of a tlewet of 
resolution, such data is best encoded in potygon form, the 
precise method depending on the equipment available, 
following the arguments made earlier in the paper. 

An initial level of resolution is now determined, and 
all available data gridded at that Level to form a data base. 
for analysise it the accuracy must be changed later to 
permit a detailed analysis of part of the study area, the 


| 
i 


polygon data sets can be regridded with no diftficultye) 
Initially, more information will be needed as the study) 
progresses and should be acquired in grid form if needed for) 
one level of resolution, or on polygon form if required more 
generallye | 

Several current trends in the computer hardware. 
industry are tlikety -to affect the area of geographic 
information systems in the near future, in some cases by 
altering these conclusions, in others by reinforcing theme 
First, the cost per operation is likely to continue to drop, 
along with the cost per unit of central memory The effect 
will be tc improve the feasibility of automatic polygon 
datae This should encourage the maintenance of Large polygon 
data sets, which will relieve planning agencies of much of 
the responsibility for data collection for planning studiese 
Secondly, the introduction of new forms of sotid state 
circuitry at vastly lower cost and smaller size is leading 
to the introduction of parallel—processing systems, which 
can perform many similar operations simultaneouslye This 
trend is of particular importance to geographic problems, in 
which large arrays must be processed with highly repetitive 
operationse Finally, although geographic arrays are large; 
processing is basically sequential in many operations, such 
as those performed by PLUS/2.-. Many of the features of large 
computer systems (such as extensive core memory and direct 
access disk) tend to be unnecessary; and indeed PLUS/2 can 
be operated efficiently in a smalt mini-computer system with 
a fast central processor and a sequential access disk but 
very Little coree 

This paper has identified the major issues in 
eeographic data encoding and processing from the viewpoint 
ot applications in the planning field. Geographic data 
processing is now entering a rather Lengthy phase ot 
demonstration and applicatione Planners, not computer 
technicians, must be made more aware of the possibilities it 
offers, through more efficient responses to geographic 
questions and through the new kinds of geographic analysis 
that it permitse 
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REVIEW 


Je Beaman 


In the following, one will find no explicit comments on 
either TN 15 or TN 42.4 As indicated in the Introduction, TN 
15 is provided only as an example of the application of 
methodologye TN 42 is a combination of general statements 
about a particular system: it does not pursue the matter of 
‘how a system should be developed, rather it presents some 
alternativese For these reasons there is really no basis for 
constructive critique or other reviewe However, the reader 
who wants some vatue judgments about developing geographic 
information processing systems wilt find these elsewhere in 
this Volume, where there are some quite pointed statements 
about the value of using geographic information processing 
systemse The reader may wish to refer to TN 27 and the 
review remarks on ite 

From a global and negative perspective, it is not 
unfair to suggest that TN 24 does not fill a tlarge gap in 
the CORD Study anmalysise The note offers too Little accuracy 
assessmente This is true in terms of the scope of activities 
covered and in terms of depth of analysis to give the reader 
any kind of good feeling for the general accuracy of the 
CORD Study Nationat Survey data and the problems with these 
datae This is not a critique of the author but rather of the 
CORD Study itself and a critique that applies to many 
(actually most) other studies of peoples" participation in 
activitiese Fantastic amounts of money are spent every year 
collecting information about which there exist major doubts 
either about the reliability of the information gathered or 
its validitye And when it comes to the matter of reliability 
and validity, one touches on one of the strong points of TN 
246 
| There are many researchers who are thrust into a 
research role even though their training may be in planning 
or other fieldse These people, as well as some researchers 
who should know better, either show no concern with accuracy 
of information that is collected or, particulaly in the case 
of statisticians, carry out some kind of accuracy analysis 
on the basis of a split sample, which gives estimates of 
reliability but which does not confront the basic issue — 
the reliability of the information collected. One example 
serves to show the kinds of concerns that should have been 
dealt with with respect to all activities on which 
participation data were collected in the CORD Studye When 
one uses the 1972 CORD Study data on numbers of trips to 
| particular sites to estimate actual 1972 use, one gets two 
to four times as much use as there actually wase The 
dramatic way in which this illustrates the deficiencies with 
the COkD Study National Survey data removes any need for 
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further comment on TN 245e The example shows both the Lack of 
reliability and lack of validity of CORD Study data is made 
abundantly cleare The magnitude of errors (related possibly 
to poor questionnaire designs, possibly to poor interviewing 
or to something else) has resulted in peoples’ responses to 
questions having almost no relation to realitye 

To a researcher with a theoretical or mathematical 
orientation TN 38 thas to be a rather fascinating technical 
note, although the reader who looks for elegance in such 
treatment is probably rather disappointed in that a number 
of issues seem to be left hanging, for example: 


(1) Why should one use the Linear exponential 
distance function in such an analysis? 

(2) Why not start by introducing all the price 
terms that are ultimately going to be 
considered so that the results to be presented 
can be derived ina 1, 2, 3 fashion? 

(3) What are some of the "other" problems with the 
formulation alluded to other than the 
condition that suggests that an increase of 
price per visit of 10 cents results in the 
same change in use whether the current charge 
is $10.00 or 10 cents? 

(4) What problems with other methods of estimating 
consumer surplus, etce does this method 
overcomes and are there reasons to suggest 
that this approach gives better results than 
such methods or does it merely hide problems 
in a different mathematical way than some they 
are hidden by different approaches? 


Still other matters can be cited as showing areas that 
the paper did not explore and which would have been 
desirable to explore, but in all fairness the objective of 
the paper was to deal with one probleme A single paper 
cannot deal with all matters related to a new methodology 
nor can the author of the paper be expected to enumerate all 
the issues that arisee So this paper is in some ways 
problematic in the way in which earlier papers on estimating 
demand function have beene 

The points raised above are presented because this 
reviewer feels that the method proposed in TN 38 should be 
used intelligentlye Also, he feels that there are many 
areas of research opened up by TN 38 which cry for further 
exploration and that these areas should not be ignored 
because of disputes among experts about the merits of the 
consumers surplus concept or because some economists say 
certain approaches to certain problems should not be usede 
In fact, in a political situation, choices must be made 
based on the approach that the opposition will uSsee 
Therefore, in resource allocation disputes, high resources 
values corrected for time bias, etce Should be used and 
defended rather than Losing some dispute with resource 
developers because researchers chose to be "purists". 
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The material in TN 8 reflects a shift in emphasis in 
developing the "Loading curve" methodology that took place 
from 1971 until 1975 (when the note was put in roughly its 
final form)e Since the author of this review is one of the 
co-authors of the note he knows that, originally, the paper 
was to be about decomposing all use of a park into two 
functions, a uniform function and a peaked functione After 
the actual estimation exercise began it was realized that 
such curves Should have been developed for individual 
origin-destination flowse So, even though the methodology is 
described as if results had been obtained for origin 
destination (city-park) pairs estimation results are 
'_presented for the total use of certain parks for campinge 
As research oon park use has continued, it has become 


_clear that to capture the real "structure" associated with 
visiting parks, it... is: not enough to develop uniform 
functions for different origin-destination flows: They 


Should be developed .for different classes of visitors. 
Different functions apply to day-users, long stay campers 
short stay local campers and non-local users who obviously 
had other trip purposes, etce (ee ge see proposals for 
modelling in TN 40). But it was also clear that the CORD 
Study Park User Survey data were so poorly documented (see 
the CORD Study Data Documentation Volume) that the data set 
could not be used to estimate the coefficient for the kind 
of disaggregated loading curves to which reference was just 
madee 
This obviously biased reviewer helieves that this note 
_@ives researchers insight into questions that should have 
been asked in CORD Study surveys regarding types of park 
visitorse In other cases it points up technical problems 
regarding the need to build models for different types of 
users based on properly collected and weighted datae Also 
this reviewer believes that the use of the Loading curve 
decomposition method does have great potential in aiding 
researchers to make effective use of the rather Limited 
information that should be collected in most surveySses 
In an organization which is supposed to be efficient in 
collecting information and utilizing it, it is important 
that there not be “overkill” in data collection, so if using 
such an analysis approach adds to the efficiency of use of 
information this is goode But, as to the implications in 
the article that the effects of weather can be derived, 
doing this has more doubtful merite A practical question is: 
What information about the effects of weather on park use do 
researchers usually need to generate for managers or for 
rlanners? Obviously, if one has loading curves for a4 park 
and these are broken down by different origins and one has 
an idea how weather affects weekend, week-day etce visitors 
from near and far, then one can take historic weather data 
and see what effects weather is expected to have in the 
futuree Unfortunately a park does not staff for the 
exceptional season and analyses may be of more academic 
interest than of practical interest. The idea of using such 
information in park design may have some merit, but in 
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reality the kind of weather information that researchers 
should make available to planners for master planning is 
down to earth description of the weather in terms that 
relate to the facilities that they are consideringe A 
weather is often bad so that some of the kinds of users who 
are predicted to come to the park would net want to be 
outside, the researchers may ask the planners: Is there 
provision for an indoor interpretation centre or are there 
some alternative activities indoors that relate to the 
objectives and design of the park and such that weather will 
not be a problem in encouraging use of the park? 

The previous points make it clear that much work needs 
to be done before the "Loading curve" methodology can either 
be endorsed for some purposes or generally rejected as of 
Little use except for some academic research projectSe 

In reviewing TN 21, one could present a commentary on 
what happened in CORD Study and Parks Canada Park User 
Surveys from 1968 to 1974 but it would be very similar to 
the "hindsight" history presented for TN 8. This is because 
what is presented in the note is some information about a 
methodology that evolwede However, introductory comments to 
this chapter and overview comments in Chapter XI cover some 
relevant background detailse So some specific innovations in 
survey weighting and data collection are noted both to bring 
them to the readers attention and to be able to contrast 
their relevance to general needs for data collection 
analysis methodology with other methods of data collection 
which this reviewer now sees as more important in meeting 
most of Parks Canada's needSe Regarding data weighting, a 
procedure was employed that filled the time spent on lunch 
or on other breaks by interviewse One has seen that such 
gaps were "filled" by data that were collected near the time 
at which a break was takene This should help to ensure that 
the interviews as nearly as rpossible "represented" the data 
that would have been collected in the period in which no 
interviewing took placee In many cases all that is done ina 
survey is that information is collected for a certain day on 
the total number of vehicles entering a park, and the 
questionnaires for that day are given a weight that 
establishes what part of total traffic (how many vehicles 
out of the total traffic) a certain questionnaire should 
represente 

This brings up the point that the weighting for a 
particular interview is much more dissagregated under the 
system presented than under usual weighting procedurese The 
tact that information collected in each half hour is related 
to entering traffic during that half hour (and that this is 
done on the basis of the breakdown of that traffic by 
origin, etce) reflects a legitimate concern with getting as 
much information out of a given amount of Labour input as 
possible (as well as a concern with accuracy )e By having 
what is described in the article as a "#Lloatinz sampling 
rate", it is possible to have field workers work hard when 
traffic is heavy (by making contact with people in as many 
ot the vehicle as possible that pass through a gate) and at 


CHAPTER Vil page 630 


the same time work as hard as possible when traffic is slow 
by getting interview information from people in all of the 
vehicles that pass through a gatee One should note that the 
survey strategy used by Parks Canada from 1971 — 73, which 
involves getting what is called “entry record information" 
from every vehicle, is important because many of 
management's questions can be answered by using only entry 
record information most of which is gained by simply 
observing a vehicle and its occupantse These figures can be 
obtained very accurately because many vehicles can be 
stopped when only one or two questions are asked (e@ege do 
they plan to stop in the park)e If the parties in the 
vehicles do not hand back their questionnaire (and 50% do 
not) this entry record information can still be used in 
improving the weighting of those questionnaires that are 
returnede 

The preceding comments serve to acknowledge that much 
‘of what is recorded in the TN involves justified 
‘modifications to usual survey procedures that either serve 
to improve accuracy statistically and/or increase the cost 
efficiency with which a given level of accuracy is attainede 
However, it must be acknowledged that development of 
‘computer programs to process the data collected, the rather 
time consuming procedures of defining appropriate weights 
tor different survey days for which estimates had to be 
made, etce involve much more work than occurs when a survey 
jis processed by simply specifying a collection of day-by-day 
weights to be associated with all data collected on those 
particular dayse Obviously, depending on what accuracy of 
|results is needed, there may be justification for employing 
a very simple survey strategye The amount of time that 
survey staff will be used (related to the total data to be 
collected) and the improvement in accuracy attained by using 
more sophisticated surveying procedures that increase 
“processing and editing costs; and use excessive manpower, 
-etce often justify "simplistic" but well thought out survey 
_procedurese But no comparative accuracy figures, procedures 
tor determining what accuracy will be achieved one way or 
another, etce are currently available so that the best thing 
to do in a given situation is often problematice 
| Possibly the most important thing to realize regarding 
Technical Note 21 is that it presents details on a survey 
procedure which, although carried out from 1971 to 1973, was 
not the best procedure for collecting the information for 
_which there was really a neede One can now see that manpower 
was used in an inefficient way in counting the vehicles: 
traffic counters that cost less than one field "surveyor'!s" 
salary for the summer could have been usede Cne may also 
note that traffic counters do not take afternoons off, nor 
do they take coffee breaks or Lunch breaks: they usually 
work seven days a week for 24 hours a day with a minimum of 
caree 

It is not unusual to find that there are money and 
manpower available to carry out a survey that are quite out 
of proportion to the resources available to decide what the 


| 
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objectives of the surveys are and to take care in 
operationalizing these into a data collection anatysisy 
report preparation efforte For example, if, as has been the 
case in the past, the primary use of survey information 
collected at National Parks is to develop good use estimates 
for those parks, it has been recognized that there is no 
need to stop vehicles at park gatese Presently a new method 
of getting data for use estimates is being tested in which 
entry to parks is recorded by traffic counters at ail park 
gates. "Survey teams" simply obtain Licence plate numbers 
and other data (such as the number of occupants of a 
vehicle) for certain blocks of timee Similar information 
collected at park facilities (campgrounds, etce ) allows the 
movement of a vehicle within a park to be traced and (for 
example) allows length of stay of visitors ina park to be 
monitored in a truly accurate waye It is surprising how much 
information can be generated in this waye 

Now a reasonable question is: What is the cost of this 
procedure and are there operational problems involved in 
employing it that may mean that; in many cases, a survey 
strategy where entering or exiting parties are actuality 
interviewed may produce acceptable results at a Lower cost? 
One should note that having an objective record of what 
people did in a park is often superior to results obtained 
by survey because people often do not recall what they did 
or know exactly what something they did is called, so that 
they often cannot answer a questionnaire adequatelye One 
example is that when the standard park users survey 
technique reported on TN 21 was “improved" and employed ina 
survey in Prince Edward Island National Park, it was found 
that so few people realized where they had been in the park 
that use estimates for the Park were 50 percent higher than 
the known actual usee (An error of over 250,000 visitors was 
made in estimating the number of out of province visitors to 
the parke ) This kind of error simply cannot occur witha 
study in which licence plate information is recordede In 
this Latter type of study one is not dependent on a person 
recognizing that certain locations are park boundariese 

Thus although TN 21 has presented some useful 
innovations in survey weighting, it has also indicated that 
refining weighting to gain slightly more efficiency in 
estimates of some use figures may miss the real point of 
what needs to be donee It is impossible to stress 
sufficiently the importance of careful definition of 
information generation objectivese Collecting information 
that is more accurate than is needed, that is more extensive 
than is meeded, etce are park user survey inefficiencies 
that really need to be eliminatede However, eliminating them 
is tar from simple and it is only as researchers proceed to 
build up a catalogue of the best current procedures for 
information collection to meet specific objectives that 
there can be widespread use of appropriate research toolse 

The key need today in recreation research is careful 
definition of objectives of research and careful 
operationalization of these objectives in as efficient a way 
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as possible by a thorough description of exactly how such 
and such a table or factor analysis result will be presented 


to aoplanner or manager to answer his questione The 
researcher with practical concerns working for a parks 


organization or at a university cannot be expected to be 
totally aware of all of the statistical "tools" that are 
best to achieve certain ends for the variety of projects 
that arisee But this will be less of a problem when 
specialized researchers have produced (and updated on a 


regular basis) a procedures manuale Such a manual must not 


just be a survey procedure manual but must have statements 
of objectives related to an information neede Based on this, 
a discussion should present how a specific information 


gathering procedure is justified by explaining exactly how 
the data obtained are to be processed to produce the tables, 
etce that are to be used in achieving the project objectives 
by doing (adeeey (D)eces etce It is this kind of complete 


project analysis guide that is desperately needede This 
approach to project improvement is not described in TN 21. 
There is commentary elsewhere in this volume on TN 10, 
specifically in relation to Technical Notes 32 and 37+ (See 
also various commentaries in Chapters VI, VII, IX and XIe) 


Actually, Technical Notes 32 and 37, as indicated in Chapter 
VI where these notes appear, deal with many of the matters 
that might be taken up in review comments here so the reader 


is referred to these notes for details on the possible 


applications of methodologies described in TN 10 and a 
critique of the methodologiese 


The one point is not sufficiently stressed elsewhere 


concerning determining natural “classes of people" as 
opposed to mathematically definded homogeneous groupingse 
The concern is with a scheme that allows individuals to be 
defined as members of a class to which they belong even if 


some of their characteristics are at odds with some 


characteristics of other individuals in the same classe For 
example, who would think, seeing a tamarack without its 
needles, that although most people call it an evergreen, it 
is in fact a deciduous conifere That its needles fall off in 


the winter is not the important factor to consider in 


classing this tree as a conifere Similarly, in the context 
of the outdoor recreation demands study data, it is not 


surprising that in the middle 60's very few people on the 


Canadian prairies skied (nordic) during the wintere There 


were no ski hills! So in defining activity packages it is 
critical that a person not be excluded from a certain 


activity package which includes people with participation in 
other similar activities simply because at a given point in 
time they live in a region in which they do not have 
facilities for certain activitiese Such people may even 


substitute snowmobiling for skiing so that their activity 
package appears to be more disparate with some people with 
whom they should "naturally" be grouped than if they simply 
did not skie 

The clustering computer program used in deriving the 
clusters presented in TN 10 was not set up in such a way 
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that these kind of considerations were taken into accounte 
It is possible to argue that by using a special input-—- 
computer—-program which prepares data for analysis in a 
special way, one could do a screening of information so that 
skiing in one area would be equivalent to snowmobiling in 
anothere With data transformed in this way one could argue 
that more "natural" classes of people could be derivede 
Still, if one uses the "monothetic-divisive" clustering 
program used in TN 10, a person is excluded from a 
clustering because he does not "score on one critical 
variablese The answer is not simply to go to the kind of 
aglomarative programs described in the Appendix to TN iQe At 
least with these programs the fact that a person does not 
participate in one specific activity does not eliminate him 
from a clustere One activity may weigh heavily in terms of a 
person not being included in a given clustere No, regardiess 
of which kind of clustering program is used, it is essential 
that much thought go irito introducing availability of supply 
considerations into information on participation so that a 
clustering program can "evaluate" a given level of 
participation in terms of the availability of supplye if 
supply is available and a person does not participate in an 
activity, then an important question is: is it the 
availability of supply for an alternative activity that has 
resulted in the lack of participation? Has substitution 
taken place or is this person legitimately different from 
others in terms of his preferences for activities? If the 
supply configuration confronting two people is similar and 
one participates in one activity and another person in 
another activity, substitution is probably not an isSuUee 
Still, even faced with common supply, one person may 
participate in one activity and another only because of the 
relatively equal utility of the two activitieseo But, when 
supply varies an assessment is much more difficult to makes 
Given the fact that TN 24 has shown something of our Limited 
capability to indicate how much supply a person perceives tc 
be available, one may see the impossibility of making much 
progress in studying substitutability or improving 
clustering until there is some progress in measuring people 
response to supply in making decisions (this is taken up in 
some detail from a theoretical perspective in TN 33)e- 
Obviously, much more theoretical writing on the problem 
of forming clusters in a behaviourally meaningful way must 
be done in parallel with discussion of how this theory can 
be operationalized in terms of using various clustering 
techniquese TN 10 has only demonstrated the feasibility of 
two techniques for extracting information about clusters of 
people and clusters of activitiese Furthermore » in the 
perspective ot this reviewer, the procedure of factor 
analysis actually has Little application because it is not 
tied closely to behavioural consideraticns except in the 
context of an aggregate theory which should be of little 
relevance in recreation planninge So in the note one has 
really only seen demonstrated one analysis procedure which 
may eventually have a great deal of utility but now has 
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Limited utility (see TN 32)e. 

TN 19 is a much simpler note on which to comment than 
TN 21 or TN 104¢ It has simple objectives that were attained 
and gives a relatively straightforward way to obtain more 
efficient estimates for the parameters of gravity models 
than are obtained by the ordinary least squares procedures 
usually employed by researchers (eege as used in TN ly TN 4 
and TN 18)- Also it was possible to define an absolute test 
of model adequacye The importance of this finding and the 
practical application (and implications ) of it are 
adequately stressed and explained in the notee 
| There is one point which is not stressed; on which some 
readers may find it useful to have some comments. If a model 
is not structurally adequate for the data to which it is 
applied, the use of GLS estimation does not necessarily 
result in improved estimation efficiencye Yet this should 
not be taken as a reason to avoid using GLS estimatione At 
least if GLS estimates are used, valid tests on model 
adequacy can be used to indicate that a given model is 
inadequatee Certainly nothing is lost by using GLS 
estimatione And what is gained, as is stressed in the note, 
is that one can examine the residuals giving the difference 
between observed flows and their predicted flows to see how 
to improve a modele 
| TN 36 cannot be faulted on the basis of the achievement 
of the basic objective that was set for the research 
reportede The author proceeded in a rather elegant way to 
show the value of R2 that would be expected when models Like 
those presented in TN 12 were developede However, some 
readers may wish that the author had used a rather more 
easily understood derivation that does not involve using the 
calculuse Dealing with the problem using discrete 
distributions and “simple" probability theory is possible 
and quite easye It is probably also true that the reader who 
has a strong enough methodological orientation to read the 
article has the calculus necessarye Still, in the view of 
the reviewer, the average reader has a better chance of 
getting a clear idea of what has been proved, why and how if 
the discussion is translated into one using summations and 
"discrete" probabilitiese 
The substantive grounds on which this article can be 
criticized obviously do not relate to the main objective of 
the articlee Rather, when the author proceeds to introduce a 
test for the structural adequacies of a model he does not 
consider that the ordinary least squares estimation approach 
on which his formulas are based gives residuals which should 
not be used in the test proposede Basically, the author (as 
indicated in TN 12 and a footnote in TN 20) is working with 
a model in which the assumption is that one is calculating 
probabilities of people's participatione These probabilities 
(the dependent variable ) when estimated define a 
heteroscedastic dependent variable distribution, so the 
author's test for structural adequascy must be made taking 
this into accounte His test equation should be replaced by: 
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@E (0Ck)—PC kK) )2/P( kK )C1—-P(kK)) for all ky 
WHERE 1-—d> P(k)>da>0 

X2(N—-dft) = 
@E ( 0Ck& )—-P(C kK) )/ d(C i-d) 
for P(k)>1i-d or P(k)<d 


The reason for using this formula is indicated in the note 
belowe If one estimates the model parameters, considering 
the heteroscedasticity of the dependent variables, then it 
is not only necessary to use the equation given abovese The 
formula would be used after the results of estimating the 
model a first time have been used in a second GLS estimation 
cycle where, for example, a first estimate of {po} (i) is 


considered to have a variance {p}(i)( 1 - {p}(i)) if O< @ < 
{p} < 1-—-da < 1 and di —- d) otherwisee In the computer 
program developed by Parks Canada for this, d = «Ole 


SUPPLEMENTARY NOTE 


To try to give the reader further feel for tow CORD 
Study research has progressed, the following note has been 
added to this chapter review, rather than changing the 
material on the "chi-squared test of model adequacy" just 
rresentede While the author of this review was working on 
revisions to what had been thought to be the final version 
of TN 6 and reviewing the Smith and Cicchetti article (see 
Arpendix A), the practical import of the following 
conclusion by Smith and Cicchetti became clear: 


Most survey research using dichotomous dependent 
variables for the estimation of linear probability 
functions utilizes fairly large data sets (in 
excess oft one hundred observations ) for 
statistical estimatione AS a consequence, "probit— 
Like" estimators impose substantial computational 
costSe Therefore, OLS or GLS procedures are 
frequently chosene While our results cannot 
discern the loss in efficiency if an OLS estimator 
is utilized, they do indicate that there is Little 
or no gain from Aitken estimation for such large 
data setsSe Moreover, traditional hypothesis 
testing with Student's t-test for the coefficients 
of such models can probably be expected to yield 
reasonably powerfull tests with large samplese 


When the estimate of variance in predictions of number 
of male hunters in Quebec were computed and it was noted 
that with regression coefficients based on 1291 
observations, both unweighted and weighted regression 
results gave ecu ally accurate predictionse A question had 
to be answered! Why go to the trouble and expense of 
carrying out weighted regressions? In reality, expense was 
the big concern because each weighting iteration requires 
that data on every observation be weighted, that weighted 
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means be computed, etce until finally after a matrix 
inversions which may be costly, is carried out and new 
regression coefficients are producede For some CORD Study 
research, getting weighted regression results based on 
obtaining initial estimates and then doing a GLS estimate, 
would have involwed reading data on 2,000 people about 100 
times and inverting a 40-by-40 matrix the same number of 
times. 

| Why the expense? At first, the obvious answer appeared 
to be so that tests of model structural adequacy, such as 
the one described just before this supplement, could be 
performede However, some thought made it clear that if 
parameter estimates are not affected much by weighting when 
sample sizes of concern are 600 to 25,000 (which are what 
are being used in CORD Study analyses), one could use the U 
and B(-)'s computed in the unweighted regression to 


calculate: 


Chi-squared( DF) = aE ((0Ci)-PCi))2/( PCA C1-PC LD) 
WHERE the sum is over i observations; 


if P(i) or 1-P(i) is less than .01 then their product 
is set to eO1; 


DF is the number of observations minus the number of 
parameters estimated; 


O(i) are observations; 
P(i) are the OLS predictionse 


If the observations are not stored in the computer for 
carrying out a regression, when a regression is carried out 
‘(and from a cost efficiency point of view should not be ), 
the kind of chi-squared values suggested can be computed by 
writing out the regression equations along with dependent 
variable identification, so that even if 50 equations are 
‘produced based on the same explanatory variables (or fewer 
dependent variables dichotomized in different ways )y one 
-meed only read the dependent variables once and compute all 
the chi-squared values to test the adequacy of say’ the 50 
modelse 
| Actually, when the chi-squared values were being 
generated as suggested above, other useful statistics on the 
“gqguality™" of a given model could be generatede A problem is 
that almost invariably there are P(i)'s that are less than 0 
or greater than 1 and thus do not make sense as 
probabilities of participatingy or when otherwise 
interpretede Counting such cases based on how negative they 
are and even writing out data on them for further analysis 
can be very valuablee Often, seeing what characteristics 
"“outliers" have is important in identifying problems with a 
modele 

In closing one should note that the Appendix to TN 29 
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shows why the chi-squared test of model adequacy for hunting 
carried out in TN 6 did not show that the model was 
inadequatee Only with the over 25,000 observations 
available in the data used to compute supply or with similar 
large data sets does the "supply factor" sum of squares 
become targe enough compared to the TSS that it would be 
detected by concluding that structural error was arising for 
some reasone 

But as reported in TN 29 (and in TN 6 and TN 20), if 
supply factors or other "effects" that should be considered 
are ignored; predictions made using a model may be seriously 
in error EVEN it estimates of the accuracy of the 
predictions show only a few percent error is likely to 
occurs Accuracy estimates, based in the method presented in 
TN 6, are appropriate IF a model is structurally sounde AS 
stressed elsewhere in this Volume, the kinds of chi-squared 
test introduced at the end of this review are weak testse If 
one rejects a model as unsound, he can be confident but if 
he accepts if as sound he cannot be confident that estimates 
made using the model will be as accurate as he may estimate, 
because undetected structural problems with the model may 
Lead to substantial biases in many estimatese 

It was only white these review comments were being 
prepared that it was recognized that the use of the kind of 
iterative variance estimation (heteroscedasticity 
correction) just described gives a test of model adequacy 
similar to that developed in TN 196 When GLS (weighted) 
regression was used in TN O29 to explain peoples’ 
participation in activities after the second estimation 
cycle one finds that: 


Residual Sum of Squares = 31759 
Degrees of Freedom = 25219 


and because the regression has been carried out by 
normalizing every dependent variable observation by its 
variance the residual sum of squares is a sum of squares 
approximately zero-one variables with 25219 degrees of 
freedom if the model is structurally sounde But one can test 
the value of RSS to see if it probably came from a chi- 
squared distribution with 25219 degrees of freedom by 
considering that because RSS = X?2, d@ = 2(0RSS) - 2025219) is 
approximately an observation of a normal zero-cne variable 
it the model is structurally sound! But d congruent to 18, 
which means that RSS is too large by so much that it would 
have the same probability of occurring as an observation of 
18 or greater in sampling from a normal zero-one variables 
This will occur less than one time in 10**50 if a rough 
underestimate is correcte Obviously there is recom for 
improving the TN 22 modele 
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CHAPTER VIII 


ALLOCATION AND EVALUATION MODELLING 


INTRODUCTION 


i 
The Technical Notes in this chapter logically fall into 
three groupse The first group, on demand function 
estimation, contains only one note and could just as well 
have been associated with TN 38 in the last chapters In it 
the reader will find that Cheung and Knetsch have developed 
an answer to the question: What effect does time bias have 
on the value of a Canadian Park? This matter was considered 
important enough by Knetsch that he and Cheung corresponded 
between Canada and Malaysia in order to bring the note into 
its final forme Ignoring time bias in estimating the value 
of a park means underestimating that value and risking the 
loss of the park (or the land that could be developed as a 
park when in fact some other use is less valuable, at least 
in terms of the economic theory behind the demand model )e 
The first large group of notes in this chapter deals 
with allocation models and allocation evaluation modelse All 
of the research presented arose independently of the CORD 
Study» The development of the potential model originally 
ocurred because of a Canada Council-supported project (under 
Dick Butler, Department of Geography, University of Western 
Ontario) to develop social indicators as part of the 
Canadian Social Indicator Projecte The fortunate 
happenstance is that CORD Study data were available when 
Butler was to work on recreation indicators and this 
facilitated the dewelopment of a methodology for indicators 
which is reported on in TN 56 Butler's original project was 
revised by Ross (with further input by Ewing), resulting in 
the final version of the papere This evolution of the note 
from conception to completion was over a period from 1971 
until 19756 
TN 17 arose because Ontario (as part of the tourism and 
outdoor recreation planning project, TORPS Study ) was very 
explicit in stating objectives of policy, planning etce When 
it was recognized that the large scale planning model would 
not be available when they needed input for certain decision 
making, the question was asked: What can we do with that 
part of the work which is completed now? Sc a model was 
developed that could be used with information on the 
location and amount of supply, and location and amount of 
population along with the policy information about how 
policy objectives were to be pursued to produce an output 
that showed the nature of the present allocation of supply 
with respect to populatione The model which produced 4 
pressure (per capita amount of supply) could also be used to 
indicate how a park development option to be pursued would 
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change pressure on facilities (hopefully in a desirable 


way )e 

The sensitivity testing of Linear programming paper was 
prepared because of CORD Study researchers' concern about 
the growing use of automated evaluation methods without the 
necessary awareness of the difficulties that can be 


encountered when such methods are usede Work in which a CORD 


Study researcher (Auger) had been involwed, and which built 
on apaper by Manning who was with the Canadian Forest 


Economic Research Institute, was the stimulus that prompted 
Auger and Cheung to critique Auger's work by looking at what 
might be wrong with a linear programming solution to the 
| problem that Auger solved using linear programminge 
Actually, it is quite interesting that doing a sensitivity 


test of the solution to a fairly arbitrarily selected 


problem (which was originally designed to show the merits of 
‘using Linear programming) results in such clear-cut evidence 
‘that using linear programming without doing sensitivity 
‘analysis may result in making a poor choice of projectst 
|'This is true even when one does not consider the matter of 
whether a good objective function can actually be definede 


TN 25 was prepared because planners and managers often 


/have Little time to do research before making a decisione So 
ina situation where a decision must be taken quickly by 
_ comparing two or more alternative proposals it is important, 
when operating under PPBS, to carry out as rigorous an 


evaluation as time allowse This article critiques a 


| traditional, quick evaluation method and offers constructive 
| suggestions for modifications that result in defining a new 
| procedure which is explained and illustratede 


The methodology in TN 26 is little changed from the 
form in which it was developed in 1967-68 in order to study 
the recreation needs of the people ina citye It was revised 
only in 1975 when it was realized that no analysis 
methodology had been presented which could use CORD Study 
National Survey data to evaluate current participation 
levels in order to make recommendations for facility 
development, for program evaluation or for program planninge 
This methodology is oriented to presenting results to 
planners and managers who want to compare the way in which 
people in different areas who are in the same age-sex group 
are being servicede Obviously whether such a methodology 
should be used depends on the objectives of the planners and 
managerse 

Only a few brief notes are required concerning the 


'concluding Technical Notes in this chaptere ™TN 39 was 
prepared because it was recognized that a volume such as 


this one should have something in it on Canadian work on The 


: Economic Impact of Parkse The lack of any guideline work on 


preparing impact studies was so pressing in the late 60's 
that Parks Canada produced a statement as part of the work 
carried out in evaluating the impact of what was then the 
proposed Gros Morne National Parke The note which appears 
here is a collection of still relevant extracts from that 
statemente TN 40 is in some respects the end product of 
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research that began in 1964 when, in line with good research 
practice, information was collected about the area in which 
a new park was to be created so that its impact could be 
evaluated lLatere After the park was created and after the 
Canadian Outdoor Recreation Demand Study had been conceived 
and begun (actually in 1973) data were again collected in 
the area of the parke Many of the people then interviewed 
were people from whom data had been collected nine years 
earliere Soy, work on TN 40 proceeded based on before and 
after datae Despite the fact that critical information was 
not collected in 1964 it was still possible to glean 
something about the major discrepancy between the original 
plan and the actual situation in 1974.6 Thus TN 40 shows 
planners and managers, from the experience of the 
development of a particular park, what might have ted to 
clearer statements of expectations and a more reasonable 
relationship between a planned product and what exists ata 
subsequent timee Finally, the other note in this section was 
prepared because there was a need for information on the 
success or Lack thereof of commercially operated 
campgroundse The original author of this article had carried 
out such research in Western Canada and for the preparation 
of this Note carried out Similar research in Eastern Canada 
so that he was in a position to make statements that 
empirically relate to both Eastern and Western Canadae In 
1975S, others became interested in using O'Riordan's work 
with respect to policy issues, and this interest in 
reflected in current version of TN 41. 
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CORD STUDY TECHNICAL NOTE 31 


THE ECONOMIC VALUE OF RECREATION AREAS: 
THE CASE OF SASKATCHEWAN PARKS 


Je Le Knetsch and He Ke Cheung 


ABSTRACT 


Contrary to what many have believed, outdoor recreation 
is amenable to economic analysis and is quantifiable. The 
expression of the economic value ascribed to the use of a 
park is the visitors" willingness to pay for this usee More 
operationally, this can be measured as the area under the 
aggregate demand curve. for the recreation services provided 
by such use of resources, over a period of timee To yield a 
net figure, the variable costs (usually of operation and 
maintenance expenses associated with the use) would need to 
be subtracted and any external values, such as preserving 
scientifically valuable artifacts, would need to be addede 
For most areas, the major problem centers on establishing a 
reasonable estimate of the demand schedulee 

The paper describes a method whereby a meaningful 
estimate of economic value for a park can be derivede Data 
from a series of parks in Saskatchewan and a hypothetical 
proposed area are used to illustrate the methode The initial 
procedure is to estimate a relationship between use of parks 
and the various factors that influence this, such as 
population proximities, attractivity of the parks, and 
competitive facilitiese From this use relationship a demand 
schedule can be imputed for a park by hypothesizing 
increasing costs to visits and estimating the consequent 
expected decrease in use totalse Theny it is possible to 
calculate the economic value of the park by measuring the 
total area under the demand curve obtainede 


INTRODUCTION 


Substantial sums are devoted to the provision of public 
outdoor recreation facilities and far larger ones are _ in 
prospect over the coming yearSe Presumably judgements of 
comparative worth of alternative proposals or of benefits 
and costs could be improved with some reasonable estimates 
of expected demand for alternative sites and meaningful 
measures of the value of such use of resourcese This is not 
to suggest that such estimates should be the sole 
determinant; other factors such as the distribution of 
sites, availablilities to all segments of the community, and 
provision of a wide range of types of facilities should also 
be considerede Predictions of the use expected at a new 
facility can usefully be based on experience at similar 
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existing sites, by casually noting similarities and drawing 
analogies or by establishing more formal statistically 
derived relationshipse The expression of the economic value 
ascribed to the use of & park can, for the most part, be 
taken to be the willingness to pay for this use on the part 
of visitors to a parke More operationally this can be 
measured as the area under the aggregate demand curve for 
the recreation services provided by such use of resourcese 
To yield a net figure the variable costs, usually of 
operation and maintenance expenses, associated with the use 
would need to be subtracted; and any external values; such 
as preserving scientifically valuable artifacts or 
preserving future options for use, would need to be addede 
For most areas, the major problem centers on establishing a 
reasonable estimate of the demand schedule. 

The intent here is to establish demand relationships 
and to derive a meaningful estimate of economic value for a 
park, using data from a series of parks in Saskatchewan to 
illustrate the methodse Parameters estimated on the basis 
of visit patterns to these parks are used to project the 
demand for a hypothetical proposed area and for the economic 
value of the expected recreation usee The initial procedure 
is to estimate a relationship between use of parks and the 
various factors that influence this, such as population 
proximities, attractivity of the parks, or competitive 
facilitiess. From this a demand schedule can be imputed by 
hypothesizing increasing costs to visits and estimating the 
consequent expected decrease in use totals from the visit 
estimation modele 


PARK VISIT MODEL 


Visitor patterns for eleven provincial parks and one 
national park in Saskatchewan were studiede 

Multiple regression of the visit rates from the origin 
areas to the twelve parks, and the four independent 
variables, resulted in the equation: 


C1) VCisj) = —-Z0693 — (663670/DC i,j )**3/2) + (5806837 P/ 
DC is j)**3/2) + 16066160 TC §)/DC is j *¥*3/2) — 
178-760 PC i DAC i Y/DC iy j P*¥*3/2) 


where VCi»xj) is the number of vehicles, in hundreds, 
travelling to park j and from origin i; DCiygj) the road 
distance in miles between i and j; Pli) the population, in 
thousands, in the origin areas; A(i) the measure used to 
determine the accessibility of alternative parks for people 
at each of the origins; and T(j) the factor used as an index 
of the attractivity of each of the parks. 

Each ot the coefficients was found to be significant 
from zero at the 5 percent lewel, and 90 percent of total 
variance amone the observations was found to be explained by 
these termSe While each of the four independent variables 
was found to be significant, the term P(i)/D(i,j)**3/2 alone 
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accounted for the major proportion of the variation, 
reflecting the very strong dependence of Saskatchewan day- 
use recreation activity on proximity of population centers 
to the arease (A series of different exponents on the 
distance term ranging from 0-5 to 3, were tested with 3/2 
and 2 being equally satisfactory and superior to the others 
in terms of the explained variationse ) The relationship 
implied by the equation seemed plausible in terms of what is 
known about recreation visit patterns, especially in its 
accounting for the expected interactions among the variables 
and the strong influence which proximity exerts on visit 
totalse 

The use estimating equation should provide a reasonable 
basis for predicting, for example, the expected use of a new 
or proposed site inthe regione AS such it should be a 
useful tool to deal with a common planning probleme However, 
beyond this it can also serve as the basis for estimating 
the value of such a proposed recreation areae 


VALUE ESTIMATION 


An estimate of the probable use of any proposed site in 
the region, similar to the parks included in the original 
surveys, and a demand schedule associated with this use, can 
be derived from the recreation visit modele 

Given a proposed tocation, values for the distance, 
population and alternative attraction variables can be 
calculated just as they were for the existing parks used in 
the original analysis and as they were in TN le The numbers 
of miles to each origin resident area and the size of the 
population are either known or can be easily looked up or 
measurede Similarly, the alternative factor values can be 
computed for each parke The attractivity or facility index 
of the proposed site must be based on an assumed development 
or provision of facilitiese Given the values for the 
independent variables, an estimate of attendance for the new 
area is obtained by substituting in the equation to obtain 
the number from each origin area, and summing over all of 
them to reach the totale 

For the present purpose a hypothetical proposal is used 
for a park that might be located a Little over 100 miles 
southwest of Saskatoon and not very close to any sizeable 
population centers, and having a fairly typical complement 
of facilities and attractionse For this set of assumed 
conditions the TN 1 model can be used to predict a yearly 
total of 13,420 vehicle parties that would visit the site - 
a number slightly below the average of 14,404 vehicles that 
went to the twelve parks used to generate the estimating 
parameterse The 13,420 estimate can be taken as one point on 
the demand curve to be imputed for the purpose of value 
determination; the quantity at zero price for the use of the 
services of the sitee Estimating other points requires the 
further calculations of expected use but at what can be 
taken to be various increases in the cost of entering the 
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parke This can be done through the use of the distance term 
in the equatione Distance is in effect a proxy for price in 
the sense that increases in distance from origins to parks 
serve as a deterrent to visitse Initially, it can be assumed 
that the cause is the increased cost of travel associated 
with increasing distancess. (The original suggestion for this 
procedure was given in Reference 12.) As the present case 
deals only with day-users, or those without expenditures 
connected with overnight stays, the variable costs of 
vehicle operation can by and large be taken to represent the 
cost constrainte For the present(1973) purpose this was 
assumed to be $0207 per mile, or $0e14 for the two way 
distancee 

Successive points on the demand curve can be imputed by 
summing the estimated visits from each origin under 
assumptions of successive added travel costs in the form of 
added distancee For example, the visit total corresponding 
to a "price" of $4e20 per vehicle is found by adding 30 
miles (30 miles at $00c614 per mile is $4.20) to each of the 
original mileages for each of the origin arease An estimate 
of use can then be made for each and the result summed to 
give the total visitse In this instance the calculations 
yield an estimate of 5,450 vehicle partiese That is, 13,420 
parties would be expected at no increase in travel cost, but 
this would fall to 5,450 if each were faced with an added 
outlay of $4e202 Other points can be derived in analogous 
manner to impute the relevant demand curve for the site; and 
the area under the curve can be measured to yield an 
estimate of the valuee In the present case the demand curve 
calculated in this manner indicates a value of about $90,000 
for the seasone 


TIME BIAS 


The model is used to develop an estimate of the demand 
curve based essentially on observed behaviour of visitorse 
That is, the empirical relationship between distance and 
visit numbers is estimated from actual use response of 
visits to Saskatchewan parkse It therefore avoids one source 


of arbitrariness of presumed response to proposed 
circumstanceSe However, the resulting estimate is subject 
to a serious biaSe This is caused by the implicit 


assumption that the only reason that visit rates vary 
between origin centers of varying distances to a park is the 
difference in money outlay necessary to travel the added 
distancee This is certainly not likely to be the casee In 
particular it would be expected that the time required to 
travel greater distances would be a major reason for the 
observed variation in visit ratese 

Thus; when the model implies how many visits will fall 
with increased money costs, the decrease is consistently 
over-estimatede The visit rate for any origin will decrease, 
but not to the extent indicated by the observed distance 
decay function, for only money costs are assumed to have 
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increased with the travel time remaining constant (when the 
simple TN 1 model is usede) For instance, if money costs 
equal to an added distance of 30 miles were added to an 
original distance of an origin from a park of 20 miles, the 
visit total could not realistically be expected to be that 
of an origin 50 miles from the parke The money outlay is 
that of 50 miles, but the time cost is still that of the 
original 20 milese The demand curve is then conservatively 
biased for all of the points except the zero "price" pointe 
More visits would in fact be expected for each assumed money 
cost increase, than would be calculated directly from the 
modele 


A CORRECTION 


The bias problem resulting from a lack of an accounting 
of the effect of time could be overcome if it were possible 
to estimate parameters for a model that would have both 
travel time and money costs as separate variablese The money 
costs could then be incremented for each origin with travel 
time held constant in the same way that the factor measuring 
alternative or substitute areas is now maintained at a 
constant level for each origine However, this is usually 
impossible owing to the high correlation between travel time 
and money costs among trips of varying lengths —- longer 
journeys cost more and take more time (see TN 33)e Toll 
roads, for instance, could introduce some variance but in 
the main it is impossible to establish meaningful estimates 
of the independent effect of eache 

An alternative procedure can be used to make a 
correction for the present omissione This is to imply a 
trade-off between time costs and money costs by replacing 
the distance term in the model with a "composite" variable 
encompassing both time distance and money distances ( See 
Reference 122) Instead of the term D(i,j),; a substitute 
combining "money mites" (D(C)) and "time miles" (D(T)) can 
be substitutede 

The specific form of such a composite variable will 
reflect the presumed shape of the money-time trade-off 
expected to prevaile For example, if a linear trade-off is 
judged to be appropriate and if it is assumed that the time 
it takes to travel amile is of equal importance in 
restraining visits to the money necessary to travel the same 
mile, then a variable (D(C) + D(T)/2) might be usede Thus 
for an observation or origin having an original distance 
value of say 20 miles, this would be replaced by ((20 + 
20)/2)3 which equals the original 20. However, to calculate 
the demand schedule, increments are made to the D(C) term 
while holding D(T) constante The resulting calculations 
yield a correction for the time bias, in that an accounting 
is made for effect of travel timee It is, of course, 
explicitly dependent on the chosen form of the variablee 

An additive form will imply a Linear trade-offe But 
different weights can be used to reflect the relative 
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importance of travel time and money costs as impediments to 
travelling greater distancese For example, if it eae 
hypothesized that the effect of the money it costs is twice 
as important as time, this can be reflected by the use of a 
variable (2/3 D(C) + 1/3 D(T))e Any weights can be used, 
with a zero weight for time being the original formulation 
which took no account of the time bias probleme 

A curvilinear trade-off could be reflected in a 
variable such as (D(C) D(T))**1/2, with differing shapes 
requiring differing weightingse A form conwex to the origin 
might be reasoned on the presumption of an observed strong 
tendency for the marginal effect of either an added minute 
or an added dollar to have a diminishing effect on 
decreasing visits the longer the trip - an added increment 
has less effect on trips of 100 miles than on ones of i0 
milese (The convex notion is similar to supposing convex 
indifference curves from generally observed diminishing 
utilities. ) ‘ 

A more definitive choice of variable form and 
weightings should in time be amenable to empirical 
verificatione Some evidence is provided by Mansfield which 
suggests that the weight or importance of time in overcoming 
greater distances to recreation sites is perhaps of greater 
importance than the money costs inwolvwed (see Reference 47 )e 
As he notes, the issue is the disutility of time traveling 
to a recreation facility and not that of traveling, say; 
within a parke By the same token, it might well be expected 
that the average impediment of time might well vary, for 
example, with the nature of the scenery or local amenitye 
Travel time would be less of a burden for a journey through 
semi-spectacular landscapes than one through more drab 
countryside, and the weights for the composite variable 
could, in principle at least, reflect thise At present, 
alternatives that appear to encompass reasonable assumptions 
can be usede 

Another attraction of this type of formulation is that 
it provides a more consistent rationale for cutting offa 
demand curve that is asymptotic to the vertical axis, for 
purposes of counting of benefits (ieee measuring in dollars 
the area under the curve )e This asymptotic property can 
arise, for example, when a model is estimated in logs, or 
when a constant term in other forms is positive. If it is 
Known from the original data that about the farthest source 
of visits is, say, 200 miles, then when the total value of 
D(T) and DC equals 200 this can be taken as the cut offe 
This is, however, quite different from having D(C) alone be 
200- For example, if (D(C) + D(T))/2 is used, an observation 
starting at 20 miles (icee both D(C) and D(T) being 20 and 
with (D(C) + D(T))/2 being similarly equal to 20) can allow 
D(C) to be incremented to 380 before the total value of 
(D(C) + D(T))/2 reaches 200. This reflects the more 
realistic possibility that it is not just the money cost of 
travelling 200 miles that has cut off the visits, but a 
combination of both the time and the money involved; and 
consequently, if the time is less than that needed to go 200 
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miles, then the money outlay could probably be significantly 


higher before the combination forces the visits to zeroe 
And, of course, the point at which this occurs is given by, 
and is thereby consistent with, the 1 to 1, or 2 to 1 or 


whatever combination of weights that is used in the 
formulation fo the variable. 


RESULTS 


Without more evidence on the shape of the trade-off 
function between time and money, the more conservative 
linear form was pursued heree While any number of relative 
weights might be used, two were selected to carry through 
the analysise The first assumes that the effect of time and 
money on visit rate decreases are equal, with the variable 
(D(C) + D(T))/2 consequently usede The second, assumes that 


money is twice as important as time with the variable (2/3 
'Dp(C) - 1/3 D(T_) usede As the resultant demand curve does 


not reach an intersection with the price axis, a value of 
200 miles for the composite variable was taken as a cut offe 
(As the curve was close to the axis, the resulting value 
estimate was not very sensitive to this particular choice of 
2006 ) 

The several points on the demand curves were calculated 
by incrementing D(C) white holding D(T) constant for each 
contributing origin area for the hypothetically proposed 
parke The area under the curves might then be taken as an 
expression of the present or initial year's economic worth 
of the recreation services that would be afforded by the 
parke In this case the estimates were approximately $159,000 


and $125,000 tor the equal time—-money weights and the double 


money weighting respectivelye Some projections over time 
would need to be made, the variable costs subtracted and an 
allowance added for non-user benefits, to arrive at a more 
complete evaluation of the total net valuee However, the 


method appears to yield reasonably defensible estimatese 
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CORD STUDY TECHNICAL NOTE 5 


POTENTIAL FUNCTIONS IN EVALUATING 
THE NEED FOR RECREATION FACILITIES 


Je He Ce Ross and Ge Oc Ewing 


ABSTRACT 


Planning the provision of recreation facilities is 
currently hampered by a relative tlack of preceding research 
and a marked absence of established standards. To evaluate 
recreation opportunities provided by sites such as 
provincial and federal parks in Canada the authors suggest a 
method of quantifying each opportunity, a measure they call 
the Opportunity Quotient, that can be applied to any 
lLocatione 

The paper discusses the philosophy underlying the 
Quotient and the problems its use entailse It treats the 
Quotient from the assumption that competition for recreation 
resources is best measured at the recreation sites 
themselves, but it also offers an alternative formulation 
based on assuming that the competition for resources is 
exerted at the point of residence of each individual, and 
that it is this force which drives one away from home to 
seek recreation opportunitye 

A practical example > of using ‘the quotient’? to assess 
opportunities is B@iven, based on the Windsor-Quebec Urban 
Corridor project of Environment Canadae 


MEASURING RECREATION OPPORTUNITY 


The combination of pressures upon limited physical 
resources and increased public awareness and desire for 
leisure activities throughout the 1960's and1970's have 
resulted in great concern with recreation and provision of 
facilities for recreation pursuitse Individuals and 
governments are now actively involved with studying and 
planning in these areasSe The classical planning problems of 
evaluating the present situation and determining the needs 
and wants of the population have assumed new importancee In 
the recreation field, however, the difficulty of coping with 
these problems is amplified by the relative lack of 
preceding research and the marked absence of established 
standardse 

Evaluation of the recreation opportunities provided by 
recreation sites (such as provincial and federal parks in 
Canada) is of current interest to the Outdoor Recreation and 
Open Space Division of the Lands Directorate, Environment 
Canadae To this end, a method of quantifying the opportunity 
is suggested which, when applied, yields a measure called 
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the Opportunity Quotiente Insofar as the quotient can in 
principle be calculated for any point in the country, the 
spatial variation in the level of opportunity can be mappede 
This paper focuses on rationalising the formula used to 
define the quotiente 


THEORY 


Opportunity for recreation is obviously dependent upon 
the relationship between the supply of recreation sites and 
the demand for theme In fact, the ratio between a measure of 
supply and a measure of demand can reasonably be called an 
index of opportunitye But a crucial and most difficult 
problem in defining a particular index is establishing a 
means of effectively representing both factors in such a 
fashion that values of the ratio for different places really 
reflect their differing levels of recreation opportunitye 
(Comments in Chapter XI indicate that some hold the view 
that the measure need only be relevant to policy, and may 
have nothing to do with "really" reflecting difficult levels 
of opportunitye ) 

In urban areas, a commonly used measure of recreation 
opportunity is the ratio of total city park acreage to total 
city populatione Such a ratio is often compared to some 
accepted planning norm of park acres per thousand people 
(see TN 26)e Even at the city level, it can be argued that 
this ratio fails to take into account variations in access 
to a park as a function of distance from home to parkSse 
Clearly a family within 100 yards of a city park is likely 
to make more use of it than a family whose closest park iS a 
mile or two distante An additional Limitation of the above 
approach is the concentration on park acreage as a measure 
of supply, to the exclusion of any other park 
characteristics, such as the nature of a park's facilitiese 
At the scale of regional and national parks, the same 
considerations of access and park facilities remain 
relevante Therefore, an index is required which 
incorporates these elements, in addition to those usually 
considered in measuring the supply/ demand ratioe 


MEASURE DEFINITION 


To begin, consider a single locatione What is sought is 
an index of opportunity of people Living at a location to 
use public recreation facilities, expressed in terms of the 
supply as measured by the number of facilities available to 
them, the characteristics of these facilities and their 
accessibility from these locatione The index should include 
a measure of demand for these same facilities, so that the 
extent of pressure on, or competition for them, is Known and 
used to adjust the supply measuree In this paper, the only 
facilities considered are parkSe 

An index of location i's supply provided by a single 
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park jy» discounted by the distance between the park and i 
can be defined as AC J)/DCisj)» where AC Jj) is the 
attractiveness of park j and Dli;zj) is the distance from i 
to je Given that ratio,y an obvious way to measure origin i's 
total supply is to add together alt the A(j)/D(€ i,j) ratios 
for parks accessible to that origin (within Some arbitrary 
distance )e (On problems with the convergence of the sums of 
such ratios see TN 3.) 

The multiplying factor park attractiveness is used in 
the formula to imply that "something" about a park really 
determines the effective supply of park-based activities to 
the publice The more attractive a park to potential users, 
the more Likely people are to use it, and therefore the more 
effective it is in adding to the supply people perceivee The 
determination of a park's tlevel of attractiveness is a 
considerable problem in itself which is not pursued heree A 
discussion of various methods of estimating park 
attractiveness is to be found in Chapter III of this volumee 
Frequently, though, simple variables which are assumed to be 
closely correlated with park attractiveness, such as park 
acreage or number of campsites, are used as expedient 
surrogates of attractiveness, for lack of estimates of 
attractiveness itself (see TN Q)eo 

However, returning to the main theme, in defining a sum 
of AC j)/D(Ci,yj) ratios, something has been implied about the 
relative importance of attractiveness compared to distance 
in terms of the effect of these factors on supplye Assume 
for the moment that the ma jor determinant of attractiveness 
is park sizee AS it stands, the implication is that one park 
of say 200 acres at 1 mile from i contributes no more to i's 
supply than another park of 2 million acres at 10,000 milese 
Cr, put another way, two origins would be implied to be 
equally supplied if one had a 200 acre park a mile away and 
the other had a 2 million acre park 10,000 mites awaye The 
questionable validity of such an assumption tleads one _ to 
consider what the weights are which should be applied to the 
size and distance terms to make the relative values of 
different A/D ratios more "realistic". 

In the context of the example the question is more 
properly defined as: How do people trade off size against 
distance when evaluating their access to supply? it 
empirical evidence is available as to the nature of the 
trade-off between attractiveness and distance the earlier 
expression may be appropriately modified to give, for 
example, Ax**a/D**dy or some other such combination of 
transformationse For example, if the appropriate function 
was D*, then in the above example about two parks, the 
smaller at one mile would contribute 200/12, ieew 200, to 
one origin's supply measure whereas the Large park at 10,000 
miles would now contribute only 2,000,000/10,0007, iveec 
0.02, to the other origin's supply measuree Obviously, 
vastly different conclusions are reached about the relative 
supply to different places, depending on the relative 
"weight" given to attractiveness and Gdistancee Since the 
assumption of any other weights, in what follows the 
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function, f( A/D), that is added up to get a supply measure, 
is defined in the non-committal f(A/D) way just introduced. 

To this point there has been no consideration of the 
effect of the presence of numerous users at the same park 
site and the possible reduction in the individual's 
opportunity to enjoy a park's facilities as the number of 
competing users at a park increasese A park surrounded by 
several nearby cities is likely to be frequently crowded, 
and, because of the congestion, the kinds of benefits that 
such a park is thought to provide are Likely to be less 
available than if the same were close to only one small 
towne Thus, the potential supply offered by a park j to an 
Origin i, defined as £f( AC j)/DCi,gj)) should be discounted 
when the number of users at jy U(j), is largee The nature of 
the relationship of crowding and supply unfortunately 
requires empirical verificatione Still, it seems plausible 
that disutility and use are related so that the Larger the 
size of the park (S). and its facilities, the Lower the 
disutility associated with a given U( Jj), a positive 
monotonic increasing S-shaped curvee Regardless, a 
generalised definition of the potential supply discounted by 
the number of competing users is: 


C1) PCiygj) = £0190 AC J)/DC i, j))/f20 U0 Jj )/S0 5) ) 


Equation 1 defines in some way the opportunity for 
people at origin i to use park je An estimate of their 
opportunity to use alt parks, called their "Opportunity 
Ratio", is given as: 


(2) ORCi)d = OE (£01)0CACJ)/DC is j)/FC2 (U0 §)/ S05 )) 


WHERE the summation is on j=l, NP3 and NP = the number of 
parks within some arbitrary distance of i (see TN 3) or the 
number of parks within the jurisdictional area to which the 
planning is related. This statistic gives a measure of the 
etfective access which residents at i have to park 
recreation facilities, and cany of course, be calculated for 
all originse 

Given that exact forms of the functions fl and £2 can 
be specified, numerical walues of OR(i) can be calculated 
and plotted on ae map and used to indicate the extent of 
regional and local variation in access to park facilities. 
Such a map. has value to planners considering locations for 
new parkse However, Since these scores are dimensionless 
ratios, it facilitates reading of the map if these ratio 
scores are standardised around a convenient value;3 e€ege a 
mean score normalized to 10 or 100e Using the latter value 
for the mean enables an origin's percentage deviation from 
the study area's average value to be directly read froma 
MAaPe Here such normalized quotients are referred to as 
Opportunity Quotientse 
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DISCUSSION 


The usefulness of the quotient depends, of course, on 
the function in Equation 2 resembling reality in some sense 
or other and in the final analysis, the reat tevel of 
outdoor recreation opportunity for any origin is what the 
people there perceive this level to bee The statistic, 
therefore, ought to thave the same spatial pattern of 
variation as perceived recreation opportunitye The degree 
of correspondence between the two depends Largely on two 
factorse Firstly, the general form of the functions in 
Equation 2, in particular, must be realistic, ieee the 
variables in this equation must be the relevant ones and 
they must be combined ina fashion that corresponds to 
perceptual realitye Secondly, the weightings involved in ft 
and £2 in Equation 2 must have empirical validitye This is 
not the place to enter into a discussion of how to ensure 
these requirements are satisfiede Nevertheless, the validity 
of the map of Opportunity Quotients depends on the above 
conditions being mete 


AN ALTERNATIVE FORMULATION 


The above discussion of the Opportunity Quotient has 
been based on the assumption that the competition for 
recreation resources is best measured at the recreation 
sites themselves, and that competition is a force which can 
“hold people away" from a potential sitee This is the 
measure that is referred to as U(j) in Equation 2. 

One can make an alternative assumption which treats the 
competition factor in a different sensee In this case, one 
assumes that the "pull" of resources is exerted at the point 
of residence of each individual, and that it is this force 
which araws one away from his home to seek recreation 
opportunitiese Because there can be no simple measure of 
this competition, one must measure it through the use of a 
surrogate, here chosen to be the population potentiale This 
statistic, long employed by geographers, distributes the 
“nressure" exerted by a population over space, usually ina 
negative exponential fashione 


The measurement of the outdoor recreation supply 
available at any point i is performed as indicated in 
Eguation 2 above, except that the surrogate of park 


attractiveness, A(j), used here is the natural logarithm of 
park size (tn SC ug) )e The justification of this 
transformation is that the strength of a person's perception 
of the stimulus of park size is believed to be directly 
related to the natural logarithm of the park size rather 
than to the park size itselfe (See TN 9 for references to 
the  Lliteraturee ) This transformation has the effect of 
givine relatively more weight to the smaller parks and 
playing down the importance of the larger onese 

Under these assumptions, an equation analogous to 
Fquation 2 is written as: 
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(3) ORC i) (€C @E tn SCj) / DCisgj )¥*¥*d1 ) 


( @E P(k) / DCigk)**d2 ) 


WHERE NK = the number of population centres within some 
arbitrary distance of a location (not necessarily a 
city - just a point in space); and the sums are for 
Jj=1,NKe 


dl and d2 = two constants, usually ranging between 1-0 
and 2-5 but not necessarily equal; 


and other terms are as previously definede 


One should note that this equation is for computing an 
CR(i) for any arbitrary location i based on the cities and 
parks that are Located within range of i, regardless of 
whether or not anyone lives at location ie 

It is not relevant to speak of this measure as 
indicating the response of "the people" at i to their 
facilities, but rather the ratio reflects how a person at i 
would "perceive" the availability of parks around him in 


comparison to the "degree" that the would recognize that 
there were people around hime The measure defined by 
Equation 2 has the more "straight forward" behavioural 


interpretation of indicating what the "effective" combined 
opportunities to participate at any location appear to be to 
a person at his given location ie The difference is subtle 
but important! 

It should be noted that the interpoint distances 
(DC igj)e DC iegk)) utilized may be actual road distances 
(where available ), Manhattan distances (or other Minkowski 
metrics), time distances or actual geographic distancese In 
this study the authors have utilized the lLattere In the 
absence of firm knowledge about the values of dil and d2 in 
Equation 3, the values of 120 have been used to develop a 
real mape 

Obviously, the statistic defined by Equation 3 may be 
scaled in the same way as that defined by Equation 2, thus 
leading to a quotient representing a deviation from a known 
mean valuee 


A PRACTICAL EXAMPLE 


The practical example discussea below was calculated 
and mapped for the Windsor-Quebec Urban Corridor project of 
Environment Canada's Lands Directorate (Lands Directorate, 
1974)e In this example the work was performed according to 
the assumptions outlined in Equation 3e The sole change was 
that the points for which the Opportunity Quotient was 
calculated were assigned double subscripts to facilitate 
calculation and mappinge 

The data utilized in the project are (i) population 
statistics incorporating all communities larger than 1,000 
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population situated within 500 miles of each of the points 
for which a particular OR(i) is computed, and (2) the CORD 
Study Facility Inventory statistics on federally or 
provincially operated recreation sites (for details on these 
data see Volume III )e | 

In order that the competition of American 
recreationists might be taken into effect, the population of 
each state was assigned to the state's capital, and this 
population taken into account if it met the distance 
constraint usede These population masses were then 
incorporated in the calculationse After calculation and 
scaling, the Opportunity Quotient was isoplethically mapped 
(see Figure 1 )e- 

The contours on the map join points at which OR(i)'s 
are equal: points at which people would perceive a constant 
ratio of the type described earlier in relation to Equation 
Je Inspection of the map produced reveals two obvious 
patternse The general influence of the American population 
lying to the south and west, which exerts great pressure 
over the study area, is most pronounced in the Windsor- 
Chatham area, and decreases in strength to the east and 
northe It is also evident in the Niagara Peninsula and south 
eastern Quebece The influence of the population 
concentrations near Toronto and Montreal are revealed by the 
east-west extent of the poorly served areas adjacent to 
them, and by the alignment of the "ridges" of opportunity 
separating theme The generally high degree of opportunity 
evident across the northern portion of the map reflects both 
the lower population pressure and the greater supply of 
recreation land provided by such parks as Algonquin, Mont 
Tremblant, Papineau and Laurentidese 


CONCLUSION 
The measurement of an intangible such as one's 
opportunity to participate in outdoor recreation is 


extremely complex and is hampered by both practical and 
theoretical problemse (The problem alluded to here is the 
one for which a solution is sought in TN 29 and should not 
be confused with the inventory problem discussed in TN 16e) 
The practical problems largely relate to data collection 
costs, particularly if variables such as park attractiveness 
have to be estimated using behavioral data on park choice 
(see Chapter III for a discussion of various methods of 
estimating park attractiveness )e The more serious problems 
are theoretical ones and centre around the problem of 
specifying a functional form for the Opportunity Quotient 
and of defining accurate estimates of its parameter values, 
such that there is a close correspondence between the value 
ot the quotient and people's perceived recreation 
opportunitye 

Two examples of alternative forms of the Opportunity 
Quotient based on rational argument have been suggestede 
Although only one has been mapped, it seem likely that there 
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would be significant differences between two maps based on 
different forms of the quotiente Therefore the final 
selection of appropriate formulae will require them to be 
validated against perceptual information on recreation 
opportunitiese To compound problems, serious methodological 
problems exist in trying to measure such perceptionse 
However, without such validation, the real meaning of any 
Opportunity Quotient will remain in doubte 
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CORD STUDY TECHNICAL NOTE 17 


A METHOD OF ALLOCATION 
OF RECREATION SUPPLY TO URBAN CENTRES 


We Acar 
(Introduction and Concluding Sections) 
Je Beaman 


ABSTRACT 


In recent years, the Ontario Government has been 
increasingly concerned with providing the public with 
opportunities for outdoor recreation on an equitable basise 
Thus, there was a need for a simple model which could detect 
the areas of potential overload in the Province'’s recreation 
supply system, given the past trends of population and 
existing and planned supplys 

Because of a strong interest in short-trip recreation, 
it was proposed that an allocation model be developed to 
study the opportunities for day trips and weekend trips from 
any ot Ontario's urban centres within K-hour travel zones of 
the centres — K being 2 hours for aay-use and 3 hours for 
weekend tripse 

It was decided, for evaluation purposes, that the only 
supply to be considered effectively available to an urban-— 
centred region was the amount Lying inside its k-~-hour travel 
zone; and that the populations of the various urban centres 
compete for the use of the same opportunities in those areas 
where the travel zones overlape Competition for a particular 
unit of supply was hypothesized to be defined by the 
pressures that centres would exert on "the supply they 
potentially have" and by a function of the travel times to a 
region where a supply is located, for which these centres 
competee 

The paper presents the assumptions just stated and 
describes the computational procedures developede 

It describes the computer algorithm which was vrepared 
to allocate the supply of recreation opportunities in each 
destinaticn region to urban centres in a way which is 
consistent with the competition and other eftects 
hypothesizede The algorithm allows the generation of a 
measure of the supply per capita effectively available to 
the residents of the differnt urban centrese 

An example that can be computed manually is presented, 
as well as the general mathematics of the stimulation of the 
competition that takes placee 

The computer programs developed have been tested 
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extensivelye It is equally possible to use them to study 
the allocation of supply using as Little as 7 origins in 12 
destinations or for much ltarger problems (eege, 25 by 156)-~ 
However, details on the programs are not providede The 
reader is referred to other sources for this materiale 


INTRODUCTION 


The model described below was developed to answer a 
number of priority planning questionse These questions could 
not wait the completion of more sophisticated modelling 
efforts in which Ontario is also involwed (see Reference 33 
and 34).e For example, one planning question is the issue of 
whether more publicly supplied recreation opportunities 
should be placed near Toronto or whether such facilities 
Should be placed in other areas which appear to he deficient 
in opportunities for recreation activities of a given typee 

A continuing emphasis in Ontario has been the setting 
of objectives for the Outdoor Recreation Program (see 
Reference J3O)e Given the proposed objectives of the 
Province, it was possible to visualize an allocation of 
facilities which embodied these ob jectivese One possible 
allocation evaluation scheme being considered by Ontario is 
defined by the model presented in this papere This scheme, 
by allowing examination of the existing distribution of 
user-days potentially available to participants in an 
activity, is designed to retlect the level at which all 
people in the Province of Ontario who desire to participate 
have access to particular kinds of facilitiese 


Rationale 


A basic premise in the model presented is that 
participation rates can be used to calculate the number of 
participants in an activity that an area "should" generatee 
"Should is used here to mean that it is possible to 
calculate the number of participants an area would generate 
if it were supplied with facilities ina way that reflects a 
standard that would be acceptable for all areas of the 
Provincee For example, the number of participants which 
“Stould" be generated could be based on a standard defined 
using differentials computed by the analysis of variances 
(See TN 12. ) 

No matter what formula is employed to calculate the 
number of participants an area "should" generate if 
appropriately supplied by opportunities, the approach 
proposed is to see how the number of potential participants 
in an area is related to the available supply of facilities. 
Ultimately, the object of this analysis is to see what 
pressure achieving a certain "Standard" of participation 
would put on tacilities available to an area within a 
particular distance of ite 

Of relevance to the objective, then, is the principle 
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of the K-hour travel zonew Because of interest in equity of 
opportunities for short-trip recreation, it is reasonable 
that only supply within two hours one-way travel from an 
urban centre be considered as available to that centree In 
other words, for short-trip recreation, here it is 
considered that supply is "available" only if a destination 
is within two hours of an origin, one-waye 

Given that K-hour travel zones are defined, allocation 
modelling would be simple if it were not for the fact that 
different urban centres compete for use of the same supplye 
Certain factors recognized as affecting park use are not 
considered in the allocation modele However, the works of 
Cheung (TN 1), Cesario (TN 4), Ross, et ale (TN 2 and TN 5), 
or Ellis and Kerr (Reference 33) do consider the effects of 
alternative facilities and/or attractivity of facilities on 
park use (see TN i, TN 3, TN 4, TN lly TN 3D and sources 
cited therein)e Basically, in formulating an allocation 
model, it was considered wise to use a simplistic theory 
that did not attempt to consider elaborate interactions 
between supply unitse A more complicated model might have 
resulted in more accurate resultse But, the formulation of 
the allocation model had the immediate objective of 
supplying information to decision makers, as well as being a 
contribution to a growing literature on supply allocation 
modelse The immediate needs for a decision-aiding model made 
it essential to place a limit on theoretical embellishments 
to the modele 


Modelling Assumptions 


One way to proceed to develop a model is to state 
assumptions and proceed to show how a model may be developed 
based on thesee That is the approach taken heree 


Assumption Al: The residents of an urban centre should not 
need to go beyond the K-Hour-Drive Isochrone 
(iee@€e,y outside the K-Hour Travel Zone) to 
receive a given level of service for day—use 
participation ina given activitye 


This assumption embodies the policy that there is an 
intention to provide facilities for certain activities ata 
certain level of service within a certain distance of most 
residents of the cities of Ontarioe This assumption is an 
assertion that Ontario; by using the model, is investigating 
the implications of accepting the view that its citizens 
should not recognize the disutility of travelling up to two 
hours one way and that its citizens should recognize 
infinite disutility for travel any further than this basic 
distancee This statement, of course, only refers to travel 
for day-use participatione The Province may indicate a 
different number of hours of travel as atlltlowable (for 
different activities and for overnight use of an area for a 
given activitye 


TN 17 CHAPTER VIII page 665 


Assumption A2: The population is homogeneous, across’ the 


province, with respect to those socio- 
economic variables affecting recreation 
participatione 


This assumption is made so that a simple approach may be 
taken to determining the number of participants that "should 
be" generated by an areae The analysis of variance approach 
to calculating participation, alluded to earlier, allows one 
to associate different “ideal" or "standard" participation 
rates with different ages, sexes, etce But examination of 
participation in Saskatchewan, where a sophisticated formula 
was used to take into account the variation in socio- 


economic characteristics of the population in various 
cities, showed that only small differences, in the order of 
10 percent at the largest, resulted from using this 


elaborate formulation rather than a simple participation 
ratee Given that one does not expect to be within 10 percent 
in this (or any) exploratory modelling effort, there is 
Little merit in introducing such complexity into a modele 


Assumption A3: The supply of "facilities" for the activity 
being considered is homogeneous with respect 
to qualitye 


This assumption is also designed to facilitate 
competition modellinge But it also has distinct policy 
implicationse It is readily recognized that there are better 
and poorer hunting areas, better and poorer picnicking 
areas, and that the problem of assessing quality, or what 
some authors call attractivity (see TN ly, TN 2, TN 4 and TN 


S)s5 is a problem which is receiving substantial 
consideratione (See Reference 24e ) 
Basically, the theoretical concern is that some 


participants in an activity receive a higher benefit from 
going to a good facility than others receive from going to a 
poor facilitye The homogeneity assumption simply says from a 
policy perspective that all facilities will be “recognized 


to be" as of equal utility for the activity of concerne 
Thus, if this assumption is used in a wiven analysis, one is 
choosing not to acknowledge the difference between 


facilities as Long as they meet the Province's basic 
standards for the given activitye At this point, assumption 
A3 is required to allow one to carry out modelling, at least 
until accurate attractivity measures have been assessed and 
adequately verified will it become possible to use these 
tactors (pziven that implications of their use remain 
consistent with rolicy) in stimulatins the competition and 
resulting allcocatione 


Assumption A4: The concept of a Yresistance to travel" may 
be used to detine a function that indicates 
how ar€asy for which different population 
centres compete, should share the supply in 
cestination units which they have in commone 
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| This assumption is an interim approximation to reality 
accepted until some more appropriate approach can be takene 
Basically; the general statement embodied in Assumption 4 is 
included here to serve as an introduction to a more detailed 
enunciation of a partition formula indicating how origins 
share supply units for which they compete this enunciation 
follows later in the texte 


Assumption A5: It is possible to treat Southern Ontario as 
a Limited number of supply units (destination 
units; eeges, 12 or 156) and a limited number 
of origin unit (eege 7 or 25 )eo 


The computational procedures that are defined in the 
discussion of the competition (and allocation) model involve 
dividing southern Ontario into origin and destination unitse 
‘Supply is associated with destination units (clusters otf 
townships), and population is associated with origins 
(cities and surrounding areas)e This assumption embodies the 
idea that the population of southern Ontario is so highly 
concentrated in cities that for the purpose of calculating 
distance (and other manipulations that are described) 
populations may be considered to be concentrated at given 
pointse It is also assumed that all the supply in a given 
unit is available at a pointe 

If one has deatt with problems where this kind of an 
approximation has been used, it will be realized that where 
‘population units are concentrated in cities this kind of an 
assumption works particularly welle Where the supply tends 
to be uniformly distributed within a supply unit and the 
distance functions do not tend to drop off in too much of a 
curvilinear manner over a supply unit, it may also be 
expected that the approximation does not result in problemse 
Thus, it may be understood that this assumption, as with 
other assumptions already stated, involves reasonable 
approximations to “reality" which are being made to 
operationalize a model that can be modified in tater, more 
elaborate calculationse 


Model Description 


Consider counties, Cc» with a supply of facilities 
yielding up to A(c) opportunitiese Let S(cyu,i) be the 
amount of A(c) allocated to a centre, Uy in stage 
(iteration), i, of the allocation processe Also consider 
that the following two tables or matrices are specified: 
D(cyu) is an element of the array that gives distances from 
supply units c to a centre Ue P(c,yu) is an element of the 
array that gives the percent of the user-days of supply inc 
that is within the Z-hour travel zone of Ue 

Now it is claimed that an allocation consistent with 
the assumptions already given is achieved by the iterative 
procedure which followse For each "i" (starting at i=0), one 
defines an equation for prorating supply in each c to each 
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P(iczudpl(u,id)/£#0 Di cyu)) 
(1) PRO(cyu,gi) = —-—--—-——-—- ~~~ 
OEP( csuldplu,i D/C D¢ c,u) ) 


The sum in the denominator only serves to normalize the 
proration so that all the supply of cis associated with 
some u (so the sum of PRO(c,u,i) over wu equals 1 for all 
c's)e In the above p(u,i) is the pressure that a population 
places on the supply available to it: is the supply 
available per participant defined by: 


(population of u)*( Per Capita Participation 
Rate applied to all u's) 
(2) P(Cu,i ) a ek anal esac tretiannes arr Sans cag ras oa aan oor 9t we one ve ere opts tar ep vats ate ieee 
aE PRO( cotegi ) ACc) 


(population of u)*( Per Capita Participation 
Rate of all u's) 


WHFRE the sumes are on Ce 


Where the sum in the denominator serves the same kind of 
normalization function as described for the denominator in 
Eguation 1, it can be rewritten using POP(u) for the 
population term and PPR for the per capita participation 
rate (which it was stated could be used in Assumption A2) 
ass 


(3) p(u,i) = POPCu) x PPR/ OE (c)S(cou,i) 


Now, Equation 1 indicates that the proportion of the 
supply in c allocated to u is inversely proportional to some 
function of distance (or travel time, should times be stored 
in the D(c,u) matrix), so Assumption A4 has been built into 
the allocatione Inclusion of Plcyu) and plu,yi) in Equation 
1 indicates that supply is prorated in direct proportions to 
these variablese So the supply inc is allocated to u in 
proportion to the amount of that supply included in each u's 
K-hour travel Zone, conditional on the pressure that each 
population centre places on the supply available to ite 

If all u's placed the same pressure on the supply 
available to them one can see that p(u,i) would cancel in 
Equation 1 resulting in: 


PC c,u)/£0 D(C cyu) ) 
(4) PRO(cyugi) = —----------~------~~~~- 
ME P(c,u)/f( D(C c,u)) 


which indicates that each u would receive a share of the 


supply in c that would reflect how much of c's supply was in 
u's k-hour travel zone and how far c was located from u in 
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comparison to c's distance from other u'se 

At this point the reader will easily recognize that the 
kind of allocation being described is that discussed under 
Assumptions Al and A4e The effect of distance on the use of 
facilities is recognized. But if the population per unit of 
supply is high, the pressure on facilities is high for some 
us; this u has "priority" compared to low pressure u's being 
allocated supply for which there is competitione However, 
it may also be recognized that to calculate the PRO 
(cyuz;id*s, it is necessary to know the pCu,id's (the 
pressures) and vice-versae But one does not know eithere 
Still, one can say that an initial estimate of the pressures 
can be obtained by assuming that all u's get all supply 
within their K-hour travel zonee So according to Equation 3: 


POP(u) x PPR 
(5) pluygo) = ----—-——-————-—— 
@E ACc) x P(cyu) 


where the PRO(ceu,i)'s have been set to i reflecting the 
approximation described abovee 

Now, the zero value of the subscript i in plu,0) 
indicates that the walues defined by Equation 5 are initial 
estimates of the plu,i)'se If these are substituted into 
Equation 1, one may write the equation for the first 
iteration: 


(P(cy,u) x pluso)/f£(D(cyu)) 
(6) PRO(cyu,l) = —————-—-—--—-~--~-~---- + 
OE P(c,gu)/f( D(C c,u) ) 


Thus one gets estimates of the PRO(c,u,1) and these may 
be substituted into Equation 3 to get revised estimates of 
the p(u,l) which can be substituted into Equation 1 to get 
revised estimates of the PRO(c,;u,2)e¢ This procedure can 
obviously be repeated indefinitely with each cycle of 
computationy i, being called an iteration and resulting in 
values that can be used in the (i + 1)th iteration: 


POP(u) x PPR POP(u) x PPR 
(7) plu,it1i) = —---------—-—--—— = ------—------- 
@E PROC cyu,i JAC Cc) @E S€cyu,i ) 


WHERE the sums are on Ce 
(8) PROC cy,u,i + 1) = (PC ceudplCu,i + 1))/£0D( ce) )/SS(u) 


So, Equations 7 and 8 are the basic computational 
equations for the atllocatione The aim is by repeated 
application, to move from the approximation originally made 
to the equilibrium values PRO(c,u,*)"s and plu,*)'s that are 
no longer approximations but actually give the population 
pressure on facilities that results when the supply in each 
c has been properly allocatede These pressures indicated by 
p(u,*) then provide information to planners or managers 
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| 
about the relative deprivation of some areas compared to 
otherse Of course, the above formulas can be written by 
replacing (i) or (i + 1) by the number (*) of the 
equilibrium or final iteration. 

A manager, by looking at a series of pressure values, 
say 7, Sy 405, 4e4y 4, 3e8, 367, could conclude that there 
was a definite need to put in facilities to increase the 
opportunities to the area with a pressure of 7. The manager 
may, however, wish to have the allocation program run a 
number of times with different alternative development plans 
to see which results in the most desirable pressure values 
or which ones results in an acceptable set of pressures at a 
minimum coste Incidentally, such runs would be made using 
projected population so that the solutions (pressures ) 
arrived at would be appropriate to the time when facilities 
were actually in placee 


An Example 


Some readers may find the preceding easier to follow if 
given an examplee So before going on to cover some special 
points, consider two urban centres A and B with populations 
100,000 and 50,000 respectivelye Assume that these centres 
are served by three supply areas each offering 1,000,000 
user-days of activity in the way indicated in Table 1e Also 
for the sake of simplicity assume D2 +B = 2D2 +A and for 
convenience use MUD to indicate millions ot user-days 
(1,506,000 user-—days = 1-5 MUD).« 

As described earlier, to begin allocation one allocates 
to both A and B all supply that they could possibly have a 
claim to: (1) is entirely allocated to Ay (3) to By, and (2) 
is allocated 100% to A and 50% to B (is over allocated )e So 
assuming PPR = 1, initial pressure estimates are: 


p(A,0) 100,000/2000 = 50.00 
p(B,0) = 50,000/1500 = 33.32 


and substituting these into Equation 1 (or using Equation &) 


PROC A,2,0) 50.00/( 50-00 + 8.33) = .857 


PROC B,2,0 ) 8e33/(50-00 + 8.33) = 2143 
The other PROC , , )*sS need not be calculated because the 
way the example was set up they are equal to one or zero 
depending on which u the’ supply being considered has been 
totally allocated toe 

Now » proceeding to iteration one using the initial 
values that have been establised and using, Equations 7 and & 
one finds for iteration 1: 
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TABLE 1 


PERCENT OF SUPPLY IN THREE REGIONS 
INCLUDED IN THE K-HOUR TRAVEL ZONES 
OF TWO CITIES 


' 


Supply Areas 


1 2 3 
Cities A 100 100 0 
B 0 50 100 
<><><><><> 
p(A,1) = 100-000/1000(1 + .857) = 53-85 
p(B,1) = 50,000/1000(1 + .143) = 10.94 


§3-85/( 53-85 + 10-94) e831 


PRO( A»251) 


PROC B,2,1) = 10¢94/( 53-85 + 10294) e169 

Carrying out a second, third, etce iteration does not 
change the pressures muche About 1e83 MUD are allocated to A 
and 1.17 to B meaning, in the general sense, that the 
pressure on the supply of A is about 5 times as high as on 
the supply allocated to Be 

| When, by using the allocation approach, it has been 
found that there is a5 toi pressure differential, the 
question is: So what? If the area with high pressure is 
still adequately supplied with facilities according to some 
standard then no action is requirede [If it is not, then as 
Suggested elsewhere in this paper, other supply may be 
introduced into the system in any number of ways and the 
resulting allocations testede However, in this case it is 
obvious that to lower the pressure at A without also 
lowering B one should add to supply area le Still, if supply 
area 1 is near urban land it may be cheaper to add sites to 
area 2 (if land there is cheapere ) 


“Technical Points About Improvements to and Possible 
Modifications to the Model Defined 


The preceding discussion has not pursued a number of 
points of importance in actually using the model on problems 
less trivial than the example. And rather than pursuing 

details further here the reader is asked to note that the 
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program and data Listing can be obtainede Inquiries about 
the Listing or about making runs using the algorithm should 
be directed to: 


Supply Allocation Recreation Analyst, 
Policy Coordination Secretariat, 

Room 6508 Whitney Block, 

Queen's Park, 

Toronto, Ontarioe 


The Ontario Ministry of Natural Resources is interested in 
learning about potential users of the algorithm and in 
assisting users in solving problems associated with use of 
the modele 

The kind of issues that arise in actually using the 
model are numerouse A few of the important ones which the 
reader may wish to note are: (1) a "convergence criterion" 
must be specified so that iterations can be stopped when a 
solution is good enoughe For this and other technical 
matters (ecege the necessity of intra-iterational loops, 
etce), see Reference 1e (2) By merely calculating allocation 
based on the factors in Equation 1 and 3, there is no 
guarantee that some areas will not be above their saturation 
limit, in terms of some standard of a maximum number of 
persons per unit ot recreation supplye Correcting for this 
involves reallocating users in away consistent with the 
general proceduree Whenever the “Load” of any one c is 
larger than allowed, then the additional people can be 
"chanelled" to other c'se (For further information on this 
pointy see Reference 1, 33,5 and 340) (3) A distance function 
must be chosene 

Regarding the latter, it should be noted that 
sensitivity to change in the distance function is relevant 
in considering resistance of distance as an impediment to 
travele Runs of the allocation model have been made that use 
different distance functionse These runs have produced 
measures of allocation with only minor differencese Thus, 
the model has been found to be only mildly sensitive to 
different distance functionse This mild sensitivity can be 
understood by noting the similarity of a variety of 
functions of distance over a broad range of distances (see 
Figure 1 of TN 14, anda Reference 34)e 


CONCLUSION 


The method of stating assumptions that define a model 
that is used to aid analysis of a planning problem has not 
gained wide acceptance in planning outside of highly 
guantitative planning circlese However, if it is recognized 
that in this paper the object of defining a simulation model 
is to use it to generate a measure of goodness of allocation 
otf the supply of facilities for participating in an 
activity, then the merits of the exercise may be clearere 

Care should be taken not to reject this approach 
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because of objections to some modelling assumptionse The 
assumptions postulated do not mecessarily define a 
behavioural system, but rather a method of measuring 
allocation that is consistent with the proposed objectives 
| of Ontarioe It must be recognized that whether or not people 
behave in aocertain way is not fundamentally the point at 
issue; assumptions that reflect objectives or policy should 
not be confused with behavioural assumptions. 

A method of deriving a useful measure of pressure of 
recreation supply has been definede There is a need to be 
concerned about how assumptions used in deriving the measure 
_may be interpreted. If this Simulation of competition is an 
appropriate yardstick against which Ontario may measure the 
opportunities it provides to different groups of its 
citizens, it is imperative that the user understand that 
yardsticke Accepting the model may result in accepting a 
supply of opportunities at a certain level as good, whether 
or not people use- these cpportunitiese (That the 
opportunities or facilities are under or over used when an 
allocation is good may only reflect the fact that people do 
not behave as if travel up to K-—hours has no disutilitye ) 

The relevance of policy and its dominance over 
behavioural considerations in this formulation is brought 
out clearly when one reviews the development of the model's 
definitione An arbitrary time of driving, namely two or 
three hours, is accepted as having no disutilitye No 
particular weights that degrade a visit are introduced to 
suggest that current policy might value opportunities at 
one-hour distance over service near the desirable Limit of 
travele On the other hand, examination of the supply 
framework used here (see TN 16) shows that the Province of 
Ontario is seriously considering a policy that takes into 
account the standard work week and degrades supply that is 
primarily available only during the work weeke The model is 
more a policy planning model than a model for predictinge 
But it is a quite acceptable policy planning tool, given the 
state of the art in outdoor recreation planning and the 
equalitarian objectives of Canadian governmentse 
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CORD STUDY TECHNICAL NOTE 23 


SENSITIVITY ANALYSIS 
OF LAND USE ALLOCATICN MODEL 


He Ke Cheung and Je Auger 


ABSTRACT 


It has become increasingly apparent that the 
sensitivity of linear programming parameters defining a 
given allocation problem is a weakness of the approache This 
paper presents the results of a sensitivity analysis 
performec on a linear programming lanc use allocation modele 
In the problem treated the objective functiony defined as 
the total profit received from single or multiple use of 
four hypothetical forestry lands, is tirst maximizede Then 
parametric programming is used to investigate the effect of 
coefficient changes on the optimal solutione 

The utility of having a set ot alternatives in a 
planning perspective is emphasized throughoute In 
particular, the example pursued points out the following 
considerations about the solution that was initially found: 


1e A shift from multiple use to single use results 
if there is a3 percent change in one oi tne 
profit coefficientse 


2e No great loss in total profit results if a 
certain activity not in the basis is 
introducede 


3. A small change in the opportunity costs used to 
determine the price for using a unit of an 
outdoor recreation activity causes a change of 
policye 


INTRODUCTION 


Over the years a steady increase in the use ot our 
natural resources for recreation purposes has resulted in a 
large amount ot Literature dealing with estimating use, and 
valuing benefits, cf outdoor recreation sitesSe Many of the 
quantitative techniques used to estimate recreation use and 
analyse outdoor recreation data in general have been drawn 
from the realm of statisticse These include rexzresssion 
analysis, factor analysis, analysis of variancey cluster 
analysis, and other multivariate methodse 

Relatively recently, an optimization technique, lLinear 
programming, has been applied to analyse outdoor recreation 
datae Meir (see Keference 48) developea a Linear programming 
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model which incorporates in a mathematical framework many of 
the factors relevant to the planning of recreation land 
acquisition expenditurese Tadros and Kalter (see Reference 
58) worked out a spatial allocation model for projecting 
outdoor recreation demand which takes the form of a linear 
programming solutione Manning (see Reference 46) dealt with 
multiple use of forest land ina Linear programming contexte 
Saitta and Schmedemann (see Reference 54) used linear 
programming to maximize the number of new visitors to a park 
on a peak weekende However, none of these papers dealt with 
a very important aspect of Linear programming, which is 
sensitivity analysis of the input coefficients usede 

So, it is the purpose of the present paper to report on 
the results of a sensitivity analysis performed on a 
modification of Manning's allocation multiple use modele 

It should be recognized that once a solution to a 
linear programming problem has been determined, the cost of 
sensitivity analysis compared to the cost of finding the 
solution is usually smalle 


A LAND USE ALLOCATION MODEL 


Consider four pieces of forest tand, each of which can 
be used for one or more different purposes: (1) timber 
production, (2) agricultural production, (3) recreation, and 
(4) multiple usee The assumptions concerning the use of the 
lands are: (1) the lands are already in public ownership, 
and (2) multiple uses (recreation and forestry use) are 
compatiblee 

Information that is known about the lands and various 
products is as follows: 


(1) The production capabilities (Table 1 )e 


(2) The resource availability for production 
(Table 2)e 


(3) The prices of agricultural and forestry 
products (Table 3)e 


(4) The prices, as derived by using opportunity 
cost of recreation products (Table 4) as 
defined by Equations (Ae2) -—- (Ae4) in the 
Appendixe 


(5) The resource requirements of each activity 
(Table 5 Je 


The objective is to determine the production tevels of the 
activities (uses) within the Limitations of the resources, 
such that total profit is a maximume 

Utilizing the information presented in Tables 2 and 5, 
one can formulate the Linear programming problem as follows: 


Maximize CX 
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subject to AX < bg XC j) 2 OF Jj=1,16 


WHERE C = The profit coefficient vector as defined in the 
ninth column of Table 5e 


X = The activity vector as defined in the first column 
of Table Se 


A = The activity coefficient matrix as defined in the 
fifth and sixth columns of Table 5Se 


b = The requirement vector as defined in Table 2e 


The problem formulated above was solved by the Simplex 
method (see Reference 26)e The solution and the levels of 
resources used are summarized in Table 6656 The maximum total 
profit obtained is $3,462 per annume The policy that effects 
this maximum total profit is to utilize all 80 acres of tand 
B for white spruce production and camping; allt 268 acres of 
Land C for wheat production; 1042 acres of tand D for poplar 
production and hunting, and not to use land A at alle 
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TABLE 1 


PRODUCTION CAPABILITIES PER YEAR, 
PER ACRE OF FOUR HYPOTHETICAL LANDS* 


Use Production of Land 

A ) B Cc D 

Timber White Spruce 0-56 0240 0-70 - 
| Poplar ~ ~~ aa 1-00 

Agriculture Barley 7-00 - ~~ ~ 
Potato xe 105.00 — 70.00 

Wheat ~ - 20-00 ~ 
Recreation Hunting . 11-00 _ _ 3200 

Camping — - 60-00 - - 

Fishing - - 81.00 al 

Multiple-Use White Spruce 0.56 0012 0e70 _ 
Poplar - - - 1.00 
Hunting 11.00 - _ 3200 

Camping _ 20.00 _ - 

Fishing - - 81-00 _ 


* These production capabilities were determined using the 
Canada Land Inventory methode Timber products are 
measured in cords, agricultural products in bushels 
except potato, which is measured in hundred-weights, and 
recreation "products" in visitor-dayse 


—- zero productione 


TABLE 2 


RESOURCE AVAILABILITY FOR PRODUCTION 
OF THE FOUR HYPOTHETICAL LANDS 


Resources Limits on Resources 
Capital $7,000.00 

Labour 700-00 man-days 
Land A 300.00 acres 
Land B 80-00 acres 
Land C 268-00 acres 
Land D 1,046.00 acres 

<><> <>< 
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TABLE 3 


MARKET PRICES OF TIMBER 
AND AGRICULTURAL PRODUCTS 


Product Price 

White Spruce $5260 / cord 

Poplar $1.34 / cord 

Barley $1215 / bushel 

Wheat $12625 / bushel 

Potato $1225 / hnundred-weight 


Sources: 


Land 


TN 23 


(1) Ge Manning ope cite (1971) 
(2) New Brunswick Department of Agriculture and 


Rural Development, AGRICULTURAL STATISTICS, 1971, 
Fredericton, New Brunswicke 


<>< <9 <29K5 


TABLE 4 


OPPORTUNITY COSTS 
FOR RECREATION USE OR MULTIPLE-USES 
OF THE FOUR HYPOTHETICAL LANDS 


Use Opportunity 

Cost, $ 
Hunting 00.47 
White spruce and hunting 0236 
Camping 2e17 
White spruce and camping 1.56 
Fishing 0226 
White spruce and fishing 6-06 
dqunting 4.95 
Poplar and hunting 4.230 

OZ OTOTOLO: 
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TABLE 5 


GROSS RETURN, TOTAL COST AND PROFIT OF PRODUCTION 
PER ACTIVITY PER ACRE OF THE FOUR HYPOTHETICAL LANDS 


| Activity Gross Capital 
Land acre Product Return(1),$ Cost(2)s$ 
A X1 White Spruce 3014 3-00 
/ x2 Barley 8-05 12-00 
X3 Hunting 5-14 3200 
X4 Multiple-Use: . 
White Spruce 1-00 
72014 
Hunting 2090 
B x5 White Spruce 2024 1-00 
| x6 Potato 131.25 175-00 
X7 Camping 130.20 30-00 
X8 Multiple-Use: 
White Spruce 0-20 
312-84 
Camping 10.00 
jc x9 White Spruce 3-92 3-00 
X10 Wheat 25200 10.00 
Xi Fishing 21.00 1.200 
X12 Multiple-Use: 
White Spruce 2200 
8-92 
Fishing 1.00 
D X13 Poplar 1.34 0-50 
X14 Potato 87-50 175-00 
X15 Hunting 14.84 3200 
X16 Multiple-—Use: 
Poplar 0.210 
14-34 
Hunting 3-00 


( ee econtinued ) 
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Land 


A 


(1) 


(2) 


(3) 


(4) 


TABLE 5 (contd) 


Activity Labour (2) Total 
acre Man-Days Cost,$ Cost(3),S Profit(4),$ 
x1 0.210 2-00 5-00 —- 1.86 
x2 0230 6-00 18.00 - 9.95 
X3 0.20 4.00 7-200 —- 1-86 
x4 
0-10 2200 
7200 0214 
0.210 2200 
x5 0-05 1.00 2-00 0.24 
X6 1.50 30200 205-00 —-73-75 
X7 5200 100.00 130.00 02.20 
x8 
0.01 0.20 
30240 1.44 
1.00 20-00 
x9 0.25 5-00 8-00 - 4.08 
X10 0.25 5200 15.00 10.00 
X11 0-50 10.00 11.00 10.00 
X12 
0.10 2200 
9-00 - 0.08 
0.20 4.00 
X12 0-05 1-00 1.50 —- 0.16 
X14 0225 5-00 180.00 -92-50 
X15 0.60 12.00 15-00 —- 0616 
X16 
0-30 0-60 
13.70 0-64 
0-50 10.00 


The gross return per activity per acre is equal to the 
unit value of the product (as given in Table 3 and Table 
4) multiplied by the quantity of the product per acre 
(as given in Table 1)e 

The capital cost per acre and the labour cost per man- 
day are believed to be "reasonable" estimates. However, 
when a real Situation is dealt with, people 
knowledgeable in these fields should be consulted in 
order to make the estimates as realistic as possiblee 
The labour cost was calculated at $20. per man—-daye 

The total cost is the sum of capital cost and labour 
coSste 

The profit is equal to the gross return minus the total 
cost per activity per acree 


<2 <>K P< ><> 


id) 
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TABLE 6 


LEVELS OF ACTIVITIES AND CONSTRAINTS 
AT OPTIMAL SOLUTION 


Activity Level Constraint Level 

XS White spruce 80 acres Capital $6,374 
production Labour 700 man-days 
& camping land A QO acres 
Land B 

X10 Wheat 268 acres land B 80 acres 
production land C 268 acres 
Land C 
Land D 1042 acres 


X16 poplar 
production 
& hunting 
land D 1042 acres 


Total Profit = $3,462.00 


OOS LOLS 


SENSITIVITY ANALYSIS —- GENERAL CONSIDERATIONS 


Clearly, the optimum values of the variables obtained 
using linear programming are dependent upon the coefficients 
used in specifying a given probleme In general a change in 
one or more of the coefficients will affect the optimum 
outcome significantlye One should note that the coefficients 
used to define a problem nearly always vary over timee For 
instance, the price of white spruce being $5-e60 per cord 
this year may change significantly next yeare Secondly, the 
coefficients are seldom known with complete certainty; in 
Some cases they are only approximations of what actually 
happens in the _real worlde An example of imprecise 
coefficients definition in the model presented in this paper 
is those profit coefficients which were derived from using 
opportunity cost as a surrogate price for outdoor recreation 
productse They are inaccurate because they are estimatede 
It should be noted atiso that, depending on the type of data 
used, models are sometimes found to be quite insensitive to 
changes in coefficientsSe By discovering such insensitive 
regions; one can save a_ Lot of time and expense by using 
approximate figures rather than precise figures obtained 
through extensive studiese Still, unless all coefficients 
used in an application are known exactly or the sensitivity 
of a model and coefficient change is known to be small, it 
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is important to investigate the effect on the optimal 


solution of coefficients taking on different values than the 


values used to obtain the initial solutione 


To study the effect of different coefficient changes on 
the optimal solution, several variational procedures can be 
usede These are: (1) varying the ob jective junction> 
coefficients; also called profit coefficients in this paper, | 
(2) varying the requirement coefficients, (3) varying the 
activity coefficients and (4) varying simultaneously the 
profit and requirement coefificientse Only procedures (1) and 
(2) were used in this studye The use of these two procedures | 


was considered sufficient for illustrative purposess 


The range of values that a profit coefficient can 


assume without involving a policy change when the other 


coefficient are held constant is presented in Table 7e One 
can see that if the profit coefficient of activity <Xx8. 


(multiple use of land B) drops to $1-39 per acre; 


representing a change of about 3 percent, the optimum policy 


would be different X8 would Leave the basis and xX5, using 


land B for white spruce production, would enter into the > 
basis )e If, inaeed, the amount of uncertainty inwolved in. 
estimating this coefficient is more than 3 percent, the 


computed or estimated optimum policy may not be the "true" | 


optimume 

Certainly the range information provides useful data to 
the decision makere For example, it indicates whether an 
existing policy should be altered if there is a change in 
market price or capital and tLlabour costse From a management 
point of view; however, the most useful interpretation of 
the range information is the clear definition of the 
alternative with which acceptable profit coefficients are 
associatede In particular, Krutilla (see Reference 35), 


among others, has emphasized the "value" of not cutting off 
future optionse In this regard, one should see which 
alternative is associated with a change in profit 


coefficient that cuts off the fewest future optionse 

Along a Similar line, if a linear programming solution 
suggests moves from forestry to agriculture involving major 
capital expenditures which are not considered in the model, 
as well as a lack of flexibility in returning to forestry; 


one should be wary of the optimal solutione This is 
particularly true if sensitivity analysis suggests that 
other alternatives may be pursued with nearly equal 


profitabilitye Alternatively, one could introduce into the 
model costs of moving to agriculture and determine a more 
appropriate optimal solutione 

The capital expenditure needed to achieve a near 
optimal situation may be small and much higher to achieve an 
optimum so that the best solution ina Linear programming 
sense based on a single project perspective may be poor ina 
broader cost-benefit or PPB frameworke (See Reference 26. ) 
To expand on this point, a project can be considered in 
isolation or as a component of a system in which there are 
many priorities and goalse If a goal is to employ people, 
moving from an optimal linear programming solution which is 


| 
| 
| 
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TABLE 7 


THE RANGE OF VALUES A PROFIT COEFFICIENT CAN ASSUME, 
WHILE ALL OTHER COEFFICIENTS ARE HELD CONSTANT, 
WITHOUT INVOLVING A CHANGE OF BASIS 


Activity, Lower Original Upper 
acre Limit, $ Value, $ Limit, $ 

X1 e —- 1-86 0212 

X2 e - 9.95 Oe 36 

x3 e - 1.86 0.224 

x4 e 0.14 Oe 24 

xs bd 0224 0-28 

X6 ® —-73-e75 2203 

X7 e 0220 6625 

X8 1239 12-44 infinity 

x9 e - 4.08 10-00 

X10 9269 10.00 infinity 

X11 ® 10.00 10.230 

iy wed e - 0.08 10.06 

X13 ® - 0216 0« 06 

x14 e -92.50 0-30 

X15 e —- 0216 0272 

X16 0237 0264 Oe 66 


©® = minus infinity 


<> <2 <>< <> 


not Labour-intensive to a near optimal solution which is 
labour-intensive may make excellent sense from a systems 
point of viewe 

Basically, the discussion presented above suggests that 


linear programming is only valuable when used as a 
management aid (as opposed to an automated way of making 
decisions )e Of critical importance in using linear 


programming is the ability of the manager to see that Linear 
programming results are interpreted in the context of (1) 
alternative solutions, (2) alternative strategies that are 
politically and socially acceptable in the short term, and 
(3) the relative ease of moving to a given solution and how 
irreversible this move ise 


SENSITIVITY ANALYSIS -—- THE EXAMPLE 
As already noted, in the sensitivity analysis carried 


out, profit coefficients were allowed to varye The Lower 
limits of some profit coefficients were allowed to go to 
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minus infinity since none of the activities corresponding to 
these coefficients was in the optimum solutione On the other 
hand, the fact that the upper Limit of the profit 
coefficient of activity X¥8 (multiple use of land B) is at 
plus infinity suggests that any increase in this profit 
coefficient will not effect a shift of policy; so long as 
the other coefficients are held constants The upper Limit of 
activity X10 can be similarly interpretede 

Sometimes one may want to know. whet it would cost to 
produce a unit of a nom-basic activity, without causing any 
activity to leave the basise For instance, using one acre of 
land A for white spruce production and hunting (activity X4) 
would decrease the total profit by $0410 (Table 8)e This 
would also have the effect of increasing the capital used by 
$1.83 and decreasing the use of Land D for poplar production 
and hunting (activity X16) by 0-38 acres, while activities 
X8 and Xi0 would be unafifectede The maximum number of acres 
that can be used to carry out activity X4, without causing 
any activity to leave the basis, is 1456 If all of these 145 
acres of land A were used for activity X4, then total profit 
would decrease by $14.50 and the capital slack would be used 
upe Thus, in view of the small loss in total profit, it may 
be a "good" policy to use land A for multiple-use rather 
than to leave it completely idlee 

Another type of very useful information that can be 
gained from a post optimal analysis of linear programming 
solution is that gained by varying those requirement 
coefficients which are used at their Limit Lewelse If it is 
recalled that for the current optimal solution (Table 6) all 
700 man-days of labour, 80 acres of land B and 268 acres of 
land C were used up, it may be interesting to investigate 
the effect on total profit, the basic activities and the 
other constraints if these three requirement coefficients 
were allowed to relax one at a timee It is seen (Table 9) 
that an increase in one man-day of tabour increases the 
total profit by $1.20 and the capital used by $5e85« The 
$1.20 increase in total profit is the result of increasing 
the use of Land D by 1.89 acres for poplar production and 
hunting (X16)e It is also seen that the increase in one man- 
day of labour has no effect on activities X8 and X10. On the 
other hand, a decrease of one man-day of labour would 
decrease total profit by $1.20, capital used by $5280, Land 
D used by 1-89 acres, while activities X8 and X10 remain 
unaffectede The range of validity over which the Labour 
constraint can be changed without causing a change of basis 
is 147-702 man-daySse Since the original allocation of 
Labour, 700 man-days;, was entirely used up in the current 


optimal solution, it means that no more than two man-days 


can be added if the current policy of carrying out the 
activities X&, X10, and X16 is to be kepte However, it is 
possible to decrease the number of man-days by 553e Similar 
interpretations can be given to a unit change of the 
requirement coefficients of land B and land Ce 

The increased (or decreased) profit as a result of 
changing the coefficient of a constraint at its Limit Levelt 
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acre 


OF FORCING A UNIT OF A 


Activity Profit 


$ 


-1.98 
—-10e31 
-2210 
—-0.10 
—O6« 04 
—-75e78 
—660 0S 
-14.08 
-0.30 
-10.14 
—0222 
-92.80 
—-0-88 


no effect 


is called marginal 


value is valid 


within 


TABLE 8 


THE EFFECT ON TOTAL PROFIT; 


CONSTRAINTS AT LIMIT LEVELS; 
AND THE BASIC VARIABLES - 


INTO THE OPTIMAL BASIS 


Capital x8 xi0 
$ acre acre 
2242 * * 

10.225 ae te 
1-83 * * 
1-83. * x 

—-3268 —1 * 

161-83 —1 * 

—-3e 64 —-1 5 

—-7-200 x —1 

—-10-. 46 * —1 

—-7 229 * —1 
0.21 * * 

173.54 * * 

—-0.-51 * * 

<> <>< 9955 
valuee Like 


X16 
acre 


-—0.19 
—-0.57 
-—0-« 38 


NON-BASIC VARIABLE 


ext 
sub 


110 

26 
145 
145 


reduced profit, 


only for changing one coefficient 


a specified 
coetficients constante 


of the 

involved changing 
the change 
basise 


programming 


simultaneouslLlye 


those profit 


cost (see Equation 
ascertain the stability 


indication of 


change of 


X12, 
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of the 
However > 
solution 


variational 
one coefficient 
coefficient does not 
can also 
by 


one 


Such 


range 


while 


procedures 
ata 


keeping th 


gain insight 


varying 


technique 
rrogramminge (For a practical application see Reference 26.2) 


is 


several 
called p 


by using 
Here 


of this formula and 


how essential is 
different coefficients of productione 

basis changed as the result o 
the profit coefficients of 
X15 and X16 (Table 10), 
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accuracy in 


X39 


X4, 
it is obvious that the model is 


discussed so 
timee 
cause a 


Fur 


into 
coe 


one 
thus 
estim 


X79 


ent of 
stitution 


marginal 
at a time 
© other 


far 
thermore, 
change of 
a linear 
f£ficients 
arametric 


Of particular interest is the simultaneous variation of 
coefficients determined 
Ael in the Appendix )e 


opportunity 
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TOTAL PROFIT, 


TABLE 9 


THE EFFECT OF A UNIT OF CHANGE 


THE POLICY VARIABLES, 


Labour, Land By 
man-day acre 
Profit, $ 1.20 0.22 
Capital, $ 5e 85 4 39 
Land A», acre * * 
Land Dy acre 1.89 ~1.90 
X&, acre x a 1-00 
X10, acre x. aK 
X16, acre 1.89 —-1.90 
Range of 
substitution (147,702) (77,140) 
* = No effect 
<>< ><> <> <> 
TABLE 10 


OF A CONSTRAINT AT LIMIT LEVEL ON 
AND OTHER CONSTRAINTS 


Land C, 
acre 


9-69 
8.54 
x 
—0 . 47 
oe 
1-00 
—-G.47 


(259,299 ) 


THE CHANGE OF EASIS CORRESPONDING TO A SIMULTANEOUS CHANGE 
OF THE OPPORTUNITY COSTS ASSOCIATED WITH 


RECREATION 


Change in 
Opportunity 
Costs, % 


1.88 
5Se83 
-~4.60 


Resulting 
Optimal 
Basis* 


(X88, 
(X8, 
(XS, 


Xii, 
Xil, 
x10) 


X16 ) 
X15) 


* The original optimal basis was (X8, 
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AND MULTIPLE-USE ACTIVITIES 


Total 
Profit 
$ 


33742-70 
14,881.60 
22699420 


X10, X16). 
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sensitive to small changes in opportunity costse This 
suggests that the opportunity cost formula should be used 
with extreme cautione From Table 10, one also sees that when 
the profit coefficients change simultaneously by a mere 1-88 
percent, it becomes more profitable to use land C for 
fishing than for the production of wheate The new policy as 
a result of the change of the profit coefficients is using 
land B for white spruce and camping, land C for fishing, and 
land D for poplar and huntinge The increase in profit under 
the new policy is $280.70. This policy remains “optimal" 
until the same profit coefficients mentioned before change 
by 59.83 percent, when it is found more profitable to use 
land D for hunting alone than for multiple usee The result 
of a decrease in the profit coefficients determined by using 
opportunity cost can be similarly interpretede 


CONCLUSION 


In this paper linear programming was applied to the 
optimal allocation of publicly owned forestry land among 
different uses, some of which, such as recreation, do not 
have established market valuee The determination of a price 
that should be charged the users of an outdoor recreation 
facility was achieved through the use of the opportunity 
cost concept (see the Appendix )e 

However, the main objective of the study was to stress 
the importance and usefulness of carrying out a sensitivity 
analysis on the input coefficients after a Linear 
programming solution is obtainede It was emphasized that 
sensitivity analysis is an essential part of a tlinear 
programming application for technical reasons such as: (1) 
the input coefficients specified in a given problem often 
change (2) the input coefficients are not always exactly 
known, and (3) resources, such as time and money; can be 
Saved by discovering insensitive coefficientse 

Reference was also made to the specific procedures that 
one can employ to perform a sensitivity analysis on the 
input coefficientse Sensitivity analysis can provide very 
useful data to a decision maker on what alternative courses 
of action are viable and possibly better than the optimum 
linear progressing solution in terms of (a)preserving future 
options; (b) making good sense in terms of social 
objectives $3 (c) being a better solution from a systems 
perspective; and So one 
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APPENDIX 


OPPORTUNITY COST 


The opportunity cost method was used to determine the 
price that should be charged users for the use of the Land 
for recreatione The opportunity cost used is the value of 
the alternative product which would yield the maximum 
profite 


C ACL )YC1)PC1))-CAC2 Y¥"C1 PC 1) )-€C AC 3 0 CO1)tC(2)) 


(Ac1) P = mr rr rr rr nr nr nnn 
Y( 2) 
WHERE P(2) = the price, in dotlars, of one unit of a 
recreational producte 
P(1) = the pricey in dotlars, of one unit of a 


forestry or agricultural producte 


Y¥(2) = the total number of units of a recreation 
producte 
YCi) = the total number of units of a forestry or 


agricultural producte 


Y*(C1)= the number of units of a forestry product in 
multiple usee 


C(1) = maintenance and upkeep’ cost of forestry or 
agricultural production per acree 


C(2) = maintenance and upkeep cost of recreation per 
acree 


AC 1), AC2), ACS) = constants having the value of zero 
or unitye 


Equation Ael is a general equatione Dependinz upon 
whether a piece of land is used for a unique product or 


multiple use, three cases arise which are described below: 


(a) Recreation use alone (A(1) = A(3) 


A(Z) = Ode 


In this case a piece of land is used entirely for 
recreation and Equation Ael becomes: 


CY¥CTIPC 1). — COE) )d.*.. CC?) 
(Aez) P( 2) Sa a a ee 


Basically, Equation Ae2 Shows that the opportunity cost of 
using a piece of land for one unit of recreation is equal to 
the sum of the prefit that could have been obtained if the 
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land had been used for agricultural or forest products plus 


the cost of maintaining the land for recreation use, all 


divided by the quantity of the recreation producte 
(b) Recreation use in part (A(1) = AC2) = ACD) = 1 )eo 


In this casey, a part of a piece of land is used for 
recreation, the other part for forest productione Thus, 
Equation Ael becomes: 


Y¥C1)P(1) — ¥*C1)P(01) — CC1) + C(2) 


(403) P02) = ------—---------------------—---- 


| The expression Y¥(1)P(1) - Y'C1)P(1) in Equation Aed3 
represents that portion of the revenue foregone when only 
part of a piece of land is used for recreatione 


(ec) Double use (AC 1) = AC2) = AC3) = 0). 


|Finally, a piece of land is utilized for recreation and 
|'forestry at the same timee Nothing is really foregone here, 
for the land is used for one production as much as for the 
_othere The price of recreation is assumed to be equal(at 


leas( to the cost of upkeeping the land for recreatione 


| Thus, one obtains: 
BC A.4) PU2) = C(2) / ¥C2) 


| Equation Ae4 implies that in multiple use involving hunting 
or fishing, forest productivity is not diminishede This 
'notion has also been displayed by Pichette and Potvin ( see 


Reference 52) who, in one of their studies concerning 


multiple use, suggested: "It is not unreasonable to say that 


good forest exploitation aimed at ensuring a maximum 
sustained production, abundant regeneration and improved 
planning also promotes increased production of fauna by 


appropriate environmental conditions". 


The use of Equations Ae2, Ae3sy and Ae4 undoubtedly 


| requires a good knowledge of the physical capacity, 
| productivity and cost of production of a piece of land for 


every possible usee 

There are advantages as well as disadvantages in using 
opportunity cost to determine the price of outdoor 
recreation productse The advantages are that one would be 
working with timber or agricultural products for which 


market values are generally well establishede Also, the 


method is relatively simple to usee An additional advantage 
is that it enables a decision maker to see if the cost of 


providing certain outdoor recreation facilities is 
excessively highe A disadvantage is that it only permits the 
definition of a minimum price for outdoor recreation 


productse It does not tell us anything about the(valu( of 
the recreation product as perceived by the consumere (See TN 
31 and TN 38, plus references cited theree ) 
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CORD STUDY TECHNICAL NOTE 25 


AN INTEGRATED APPROACH TO 
MULTI-DIMENSIONAL EVALUATION 
AND COST~EFFECTIVENESS ANALYSIS 


Based on a paper by 
We Acar! 


ABSTRACT 


This paper shows how to integrate cost-effectiveness 
analysis with multi-dimensional evaluation techniques for 
use at the operational level in government agenciese It is 
concerned with a procedure that can be utilized when only 
one of several projects is to be the recipient of fundse 

Emphasis is on making effective use of the decision 
maker's intuition to disaggregate decisionsSe The aim is to 
consider all the different factors that should influence a. 
decision while utilizing a framework that helps to ensure 
that different factors affecting the final decision are 
given the “Wright weight". The framework for project 
evaluation is a collection of rules that guide one in 
defining scores, selecting factors for consideration, etce 
by using intuition more effectively than is possible when 
traditional scoring procedures are usede 

The method proposed offers a means through which the 
cecision maker can control the amount of influence that 
intangibles have on a decision when compared to particular 
economically measurable factorse 

An example is given to show how the integrated approach > 
can be usede , 


INTRODUCTION 


lft people are going to make judgements, it is 
imperative to take all possible steps to see that insight 
and intuition are used effectivelye 

Yet, given the topic of the article, it is easy for an 
academic to see Little value in ite He might criticize the 
"mathematics" or the psychological conceptualization as 
naivee But the real question is: Does the article present an 
approach that allows a manager to make better decisions? 


1. This version of TN «25 is based on an original paper 
prepared for for the 1973 meeting of the Canadian 
Association of Administrative Sciences (to be published 
Tar Usk. Lo FINANCE )e This edited text for Volume If was 
revised by Je Beaman and Le Belfrye 
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Suggestions that propose that one should do research for 
five years to make a decision that must be made this day or 
this week are not acceptablee Suggestions of spending $5,000 
on studies to select alternative ways to spend $10,000 would 
also be unacceptablee These points deserve serious 
consideration before this article is criticizede Actually, 
the best constructive critique of this paper would be 
another paper offering a simpler, better method of using 
judgemental assessments than the method described heree 
Regardless, the paper's prime purpose is to propose an 
evaluation technique that can be used when the staff of a 
business or government agency have the opportunity to decide 
how a program's objectives should be achieved when there are 
several projects that can be carried out and no "perfectly 
rational" quantitative method is available to select a best 
alternativee Such a situation arises with respect to 
recreation land acquisition and development, partly because 
of the difficulty of dealing with intangibles (see TN 34 on 
demand for recreation and price, and Coomber and Biswas and 
sources cited there on intangibles )e 

In the PePeBe system or similar styles of government or 
business management practiced in North America, individual 
projects are seen as embedded components in a hierarchy ot 
activitiese The intended outputs of targer and encompassing 
programmes are decided on at the upper decision-making 
Llevelse At these levels apples and oranges are compared and 
decisions about how many (much) to consume (buy, provide) 
are made on political, ideological or traditional groundse 
However, decisions to select individual projects to achieve 
a specific objective are made at the lower operational 
Levelse Here the main concern is with the comparison of 
truly comparable alternatives and usually comparison is on 
the basis of cost-efficiency of different projects, or 
efficiency in Some broader senS€e One approach to efficient 
management has involved the use of project scoring to select 
the thest" project from among several that will achieve 
program objectivese 


GOVERNMENTAL PROJECT EVALUATION AND 
MULTI-DIMENSIONAL SCORING 


Often, when it is necessary in project evaluation to 
consider those so-called intangible factors .that cannot be 
evaluated in economic terms, the various intangible factors 
considered relevant are assigned scorese The scores assigned 
to the different factors are added up and the projects 
evaluated on the basis of some total scoree This process of 
scoring has often been referred to as the multi-dimensional 
scoring of projectse 

Such multi-dimensional scoring practices are in wide 
use in .- academic, managerial and military spherese Whether 
the alternatives to be ranked are students’ papers, business 
projectS,; or prospective employees, the procedure followed 
is essentially the same; one conjures up the factors or 
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"dimensions" of an evaluation, ranking is done on each 
dimension, and then by some process a decision is madee But 
if there are two projects P(1) and P(2) to be ranked along 
two criterion dimensions A and By, there are four possible 
caseSe They are: 


ae P(1) is better than P(2) according to both A 
and Be 


be P(2) is better than P(1) accoraing to both A 
and Be 


ce P( 1) is better than P(2) according to A _ but 
P(2) is better than P(1) according to Ee 


de P(2) is better than P(1) according to A but 
P(1) is better than P(2) according to Be 


In the first two cases it is readily apparent which 
project would be preferred globallye In the last two cases a 
judgement is not readily possible because one must recognize 
that a project is the "best" because of only one criterion 
while acknowledcging that another project is better in terms 


of a different criterion dimensione Consequently, the 
decision maker may arbitrarily elect to treat the two 
criteria as equally important (even b ips the one has 


questionable validity) and make a judgement in terms of some 
total score for the two factorse 


A CARDINALIZATION TECHNIQUE FOR 
IMPROVING MULTI-DIMENSIONAL SCORING 


As just implied, one reason most ranking procedures use 
numbers that can be added, subtracted or otherwise compared 
is the intuitive anpeal of avoiding those situations in 
which a decision cannot logically be madee Whether the given 
quantifications are valid or not is another issues For 
example ,y consider the cardinalization of the previously 
mentioned rankings by means of asSigning score-values to 
each project proposal along the two dimensionSse If case de 
is under consideration, one could (for example) be 
considering the situation shown in Table le 

It the criteria are equally important, the respective 
"values" of the two projects can be reflected in their 
aggregated scores which are 15 and 106 But even if these 
really are the scores and the aggregate scores do not tall 
too close to each other, it still need not be clear which 
project truly ranks better overalle Yet this type of scoring 
aggrezation is wiaely used with various degrees ot caree 

The author claims that in order for the agyregate score 
values to be meaningful, a number of conditions have to be 
mete These include: ; 
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TABLE 1 


PROJECT SCORES FOR TWO PROJECTS 
ON TWO CRITERION DIMENSIONS 


Project Number 


1 2 
Criterion A 7 8 
Criterion B 8 2 
Ageregate Scores 15 10 
OT OTOTOLO 
TABLE 2 


DETAILS FOR CALCULATING PROJECTS SCORES 


Project 1 2 3 4 


Ae PROJECT RANKS 


Dimension A 1 4 2 3 
Dimension B 4 2 3 1 
Dimension C 3 2 & 1 
Be PROJECT SCORES 
Dimension A 100 25 715 50 
Limension B 10 70 40 100 
Dimension C 40 70 10 100 
Ce "NORMALIZED" 

PROJECT SCORES 
Dimension A 100 25 75 50 
Dimension B 75 5225 30 75 
Dimension C 20 35 5 50 
NORMALIZED 127 eS 112.5 110.0 175-0 
PROJECT SCORE 
TCTALS 
TOTAL "NOKMALIZED" 
PROJECT SCORE = 12725 +°112-e5 * 11040 + 175.0 = 825-5 
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Condition le The criteria dimensions along which scores are 
taken must be "relatively" independent of each 
other (to avoid redundancy of dimensions); 


Condition 2e The dimensions affecting the choice to be made 
Should be relevant to the evaluation of 
alternatives being considered to achiewe the 
objective of the programe 


Condition 1 implies that the criteria dimensions used 
should not involve a number of variables that are merely 
different names for the same phenomenone In applied terms; 
an individual count of each bacterial species in a water 
sample to determine the degree of pollution is redundant 
when an aggregate count ok certain types of bacteria 
describes it sufficientlye Each species count in some sense 
describes the same thinge Similarly, from a parks management 
perspective, if a score is to reflect whether there is or is 
not historic development and to reflect the degree of that 
development from a check list of kinds of historic 
developments, then it is more appropriate to incluce a 
Single variable (possibly a Guttman scale value) than to 
include a number of variablese Usually it, say, an historic 
building exists, artifacts, signs, habitation and other 


conditions "Waeautomatically" exist, so it is redundant to 
register their existence by having a great "checklist" of 
variables to use in "scoring" the sitee The aggregate score 


obtained by sunming the scores on the individual variables 
will be a poor site selection criterion and a poor general 
indicator of the nature of the sSitee 

Pursuing the earlier example, the existence of certain 
bacteria in an area designated for swimming is only relevant 
if these bacteria represent a hazard or are present when 
some hazard is Likelye The total aggregate count of harmful 
bacteria may indicate the level of pollution to a manager of 
the areas One county, rather than counts of @ variety of 
species, tells the planner whether or not the water is 
Sanitary enough for swimminge 

Turning, to the actual operation of Scoring, it is usval 
that for each specitic criterion dimension the most 
favourable alternative is given the score of 1002 Similarly 
the least favourable alternative is given a score of 0 to UL; 
the remaining projects or alternatives are ranked between 
the two extremese This effort is referred to as the "within 
variables scaling"e 

But the problem with "within variaoles scaling" is 
that, with respect to each variable, the pest alternative is 
not generally 100 times more than the Lowest ranking 
rrojecte Fealistically, this magnitude of difference in 
ranking does not occur very oftent In their 1968 proposal to 
the California State Department of Parks and recreation, 
Alfred Eaxter and Associates showed how Simply this problem 
could be eliminated (see Reference J3)e Their solution was to 
give the Least tiavourable project a score that reflected its 
"Status" in relation to the alternative that ranked 1006 
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along the same dimensione 

This scoring method encourages one to use his insight 
about a relation to express the fact that, for example, 
along dimension i, the best alternative is "4 times" more 
preferable than the worst alternativee Such an insight would 
be expressed by giving the poor project a score of 100/4 = 
25e¢ Now if on criterion j,» the preferred alternative is 20 
times better than that of the least preferred, the least 
preterred would receive a score of Se (Because it is then 
assumed that the other projects take values at equal 
intervals between the best and the worst, the inclusion of a 
very bad project in the ranking will automatically reduce 
the value of intermediate projects relative to the beste An 
exceptionally good project will have the opposite effecte As 
a partial solution to this bias, not only the worst, but the 
median project (and ewen the 25th and T75th quartile 
projects) should be compared to the best, with intermediate 
projects taking values at equal intervals betweene ) 

The flexibility introduced by not setting the lowest 
score at zero means in some way that "within variables 
scaling" is considered independently of the weighing of the 
variablese When “within variables scaling" has been 
accomplished, there remains the problem of assigning 
relative importance to each criterion dimensione Because 
eack criterion dimension has a highest score of 100, 
assigning importance to the dimensions is independent of 
scaling within the dimensione What must be done is to 
compare the criteria that score 100, regardless of the 
project to which they apply and evaluate how variable xXx 
should relate to variable Y, variable Y to variable Z, etce 
until one has, say xX/Y = 4, Y/Z = 2, Z/W = is W/M = lele The 
series just viven suggests relative weights of 8, Ze ly, 1, 
eS for XsY3Z_yWyM respectivelye 

The Alfrea Baxter and Associates (see Reference 3) 
criterion dimensions for a cost-benefit scaling procedure to 
evaluate Land for acquisition as a park can be used to 
illustrate in applied terms the import of the concepts 
reviewed to this pointe Their criteria are: 


le Suitability for state acquisition; 


2e "Ability" of the site to generate demand under 
a given development option; 


w 
e 


Endangered future availability; 


4e Cuality and balance ot the recreation 
experience provided; 


Se Economic contribution to the surrounding 
communitys;5 


6e Cpen-space values; 
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Je Appropriateness in meeting deficit in 
recreation facilitiese 


These criteria were given the following set of weights: 
cl 100, c2 = e67,y c3 = e051, c4 = .85, c5 = e333, c6 = «35, 
c7 = «e626 Now, the merit of these consultants! handling ot 
“within variables scaling" has already been acknowledzede 
But the adequacy or appropriateness of the above criterion 
dimensions with reference to conditions 1 and 2 cited 
earlier is dubiouse The Baxter and Associates’ criteria 3, 
4, and 6 are meaningfule However, criterion 7 appears to be 
misplaced: there is no need for any project if there is no 
present or future deficit in recreation facilities 
recognizede AS to criteria 2 and 5, the degree of relevance 
of these criteria should be dealt with by some type of 
economic analysis (as suggested later). Artibrarily 
assigning them weights ignores the fact that they are two 
interrelated factors, which should be considered as part of 
a rigorous, totally quantitative cost-benefit analysise 
Finally, suitability (stated as criterion 1) should not be 
used aS a criterion because it refers either to ability to 
eenerate demand under some development option (criterion 2), 
or to such factors as land quality for devetopment 
(criterion 4) or economic suitability (criterion 5)e Its 
inclusion involves either redundancy or irrelevance and may 
even pe a guess at the scale to be derived as part of the 
scalee In the Latter case this "cooking" factor which has 
the highest weight of atl criterion dimensions is Simply 
used to bias the final scoree 


INTrGKATING MULTI-DIMENSIONAL SCORING 
WITH COST EFFECTIVENESS ANALYSIS 


Thus far, a clear distinction between how the variables - 
that can be measured economically should be treated and how 
intangible variables should he treated thas not been madée 
Rut, as impliea in the last section, this is where the 
baxter anc Associates® scoring method has Shortcomingse 


Ae There is a great temptation to over-weigh the 
mass of non-output—oriented (intangible ) 
criteria with respect to ">roduction 
eitficiency". 


tbe An adequate Link between “Suitability" and the 
classical cost-effectiveness, NC-E", approach 
is not specified: there is a need to tie the 


Suitability of a project more directly to lites 
cutput to cost or quantity to cost ratio, 
POST M 


It is important to account for the non-quantitfiable 


aspects by a scoring system, but.3t should not be at the 
expense of the quantifiable ones (ieee those directly 


IN 


tv 
U1 
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expressible as output or cost figures, or expectations of 
output or cost )e 

In short, the manager must realize that the ultimate 
aim of project evaluation is to produce output and not 
abstract score figurese Accordingly, the role of scoring 
should be limited to the production of some index of 
“ageregate suitability". This "aggregate suitability" may 
modify the output/cost ratio but it should not be possible 
for it to dominate an evaluation unless this is a clear and 
conscious choice of the manager responsible for project 
selectione 

The preceding discussion gives the rationale for the 
third condition for scaling in project selection: 


Condition 3e Overall suitability of a project should be 
determined by using a multi-dimensional scoring 
method on “intangibles" to define a score that 
can be used tc modify a C-E evaluation (1) toa 
fixed degree, and (2) in a well detined and 
understandable waye 


To achiewe "true" integration of the two kinds of results, 
the manager may, for example, calibrate his aggregate 
"intangible" score in such away that for an average 


suitability score, the output to cost ratio, Q/C;s becomes 
the only consideratione 


Consider that a recreation facility or service 
manager's intuition can be used to decide what percentage of 
influence, Watt, intangible factors should have _  0on the 


tangible output/cost ratio for particular projects being 
evaluatede Then it is an easy matter to recalibrate the 
total scores to get an index, IS, that will have a value of 
1 + d for the project with the Lowest intangible total score 
and will have a value of 1 - d for the highest scoree If d = 
e25 then 15 will be e75 for the Lowest score project and 
1.25 for the highest scoring projecte The transformed values 
are merely calculated using the following equation which is 
written in away that makes it easy to see the rationale on 
which the equation is basede The equation is to compute 
IS( ij), the index for the project j is: 


1-0+( Score for project j-( Best ScoretPoorest Score))/2 
(Best Score —- Poorest Score)/2 


To emphasize why IS(j) was calculated it seems reasonable to 
call it the incex of intangible project suitabilitye 

Nowy if one has the results of a rigorous cost 
efficiency analysis, then for each project, jy Q/C (Jj) is 
availablee The crux of what is proposed here is that IS( Jj) 
can and should be used as a multiplier of O/C (j) to produce 
a moditied cost-efficiency ratio (to become the "cost- 
effectiveness ratio"): 
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M O/C (Jj) = I8S0 Gj) CQ/C (Cy) )e 


Given that IS(j) has been calculated using the scaling 
approach suggested and that relevancy and redundancy of the 
variables used in calculating IS(j) have been appropriately 
considered, the manager ends up with a MOQ/C (jj) that is 
influenced by at most a known amount by intangiblese 
Assuming that Q/C (jj) was calculated correctly, decision 
making should benefit from the very rational use orf 
information on intangibles endorsed heree 


APPLICATIONS OF THE METHOD OF CALCULATING TRE INDEX OF 
INTANGIBLE SUITABILITY TO LAND ACQUISITION DECISION MAKING 


Assume that four pieces of land are proposed for 
acquisition under, for instance, a Province's recreation 
Programe Assume that three dimensions of intangible factors 
were telt to be relevant, and the rankings of 4 developments 
along them are given in Table 2 (Part Ade Assume further 
that, along dimension A, the best proposal is deemed; by 
some evaluation committee, four times preferable to the 
worst and that this ratio is equal to ten for both 
dimensions B and Ce This would lead to the three 
cardinalizations shown in Part B of Table 2 which are the 
“within variables scoring". To compare across criteria 
assume that the evaluation committee assigned the relative 
weights 1.00, 075 and e50 respectively to dimensions A, B 
and Ce Applying these weights to transtiorm the scores in 
part B of Table 2 one gets the results shown in part C of 
Table 2e The project scores are then easily derived by 
totaling across criteria in Part C of Table 2e The results 
are shown at the bottom of Table 2. AS we can S&ey the 
spread of "Suitability" walues is from 110-60 to 175e0-¢ 

Assuming a spread of 10 percent is all that top 
management is willing to concede to considerations other 
than the project efficiency, the aggregate score must be 
recalibrated according to procedure introduced earlier to 
yield an index IS that ranges in value from eY to lei and 
which is centred arounaqd the mean value of the intangible 
scores for all projects considerede 

Using the formula to calculate IS(j) with d = el 
results in: 


1 + 2€e1 \Cscore for j — (175 + 110)/2) 
L7Se= cit 0 
and IS(j) = 1 + 2.00308 (j -— 142.5) 
So the incex values are evs eS1, 8 and 1el for 
projects 1 to 4 respectivelye Then it is only a matter of 
multiplying and dividing to get the columns in Table 3 to 


kLave the moditied project evaluation information shown in 
that tablee 
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TABLE 3 


DATA AND FINAL COMPUTATION OF RESULTS 
TO OBTAIN THE MODIFIED COST-EFFECTIVENESS RATIO 
FGR FOUR PROJECTS 


Project 1 2 = S| 
Total Output* 

(physical units) 1.55 30 10 20 
Total Cost* 

($,000's) 20 30 10 15 
Output—to-cost 

ratio C/C 0.75 1-00 1.00 1.33 


Modified Cost— 
Effectiveness 
Ratio Oe71 0e91 0-90 1.46 


OL OLOLOLO, 


* Cutputs and costs are to be discounted to 
their present valuese 


Actually, in this example, project 4 would have been 
selected even if the IS(j)'s had all been equal (it is not 
being pushed upwards by the non-quantifiable aspects Ay By 
and C of the Situation )e However, if the realization of 
project 4 was barred because of some rigid constraint (eege 
political infeasibility) the method permits identifying the 
next alternative which would be project 2 (two) in this 
caSe€e 


SUMMARY AND CONCLUSIONS 


As pointea outs, ranking alternatives according to 
several criterion cimensions by the means of an aggregate 
score is widely practisede Given the faults which have been 
indicatea in the procedures usually used, it is particularly 
unfortunate that the minor modifications suggzested here are 
not regularly usede When it is recognized that the primary 
objective of an indiviaual project is to produce tangible 
output rather than an intangible score the importance of 
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separating the intangible analysis from the tangible 
analysis is not hard to recognizee In fact, since the author 
suggested the use of a scaling technique suggested by 
others, the only original contributions of the article are 
the suggestion that tangibles be treated separately and the 
presentation of a way to doa thise The correct role of the 
index of intangible suitability, too commonly used as the 
sole ranking criterion, is to serve as a modifier of the G/C 
ratioe 

In @ practical vein, if the reader basically accepts 
the approach suggested he may find it useful to have Steps 1 
to 6 as a guide in doing his own analysese The actions 
suggested follow Logically but without some thought or 
experience with the technique suggested, one might end up 
Qqoing computations or taking other actions that the 
following indicate as unnecessarye 


Step 1 Work out the Q/C ratios and, if there are 
no intangible factors to conSider, the 
analysis is complectee The project with the 
best Q/C ratio is selected to be carried 
oOute 


Step 2 Ask management to decide on the d value for 
the particular situation being consiuerede 
If the best project's Q/C ratio is more 
than 1+ d/1- dtimes the second best 
there is no need to go any further in order 
to find the most suitable alternativee The 
higher O/C ratio cannot be "pulled" down to 


make it of a Lower rank by any non- 
qualifiable influences being taken inta 
accounte 


ster J If criteria other than mere et*ficiency are 
deemed relevant, then List the non- 
quantifiable factors that are deemed 
relevante Make sure these are truly choice- 
dimensions (ieee pure criteria )e remove 
those not obviously called for (irrelevant 
or redundant) and those better expressed as 
constraintse 


Steyr 4 Establish the "within variables scoring" 
rationales 


Step 3S Establish the between dimension weights and 
use these and the special tormula to get 
Isc j )'se 


Ster 6 hank pro jects accordin, to their modizviec 
cost-benefit ratiose 


This will establish the ranking OA the alternatives 


according to all criteria dimensions, output-guantifiable or 
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note But one consideration not raised previously is that, in 
practice, situations might arise in which the more cost— 
effective projects are not the largest onese It may occur 
for example that in order to supply all user-days'7 of 
recreation or in order to exhaust the budget, some of the 
Lower ranking projects may have to be selected along with a 
best projecte Buty this is a problem of budget allocation 
and the issues that arise are too numerous to consider heree 

In conclusion it seems important to reiterate that when 
decisions are to be made concerning the choice of projects 
for the allocation of funds, decision makers should examine 
all the factors involved —— whether they are quantifiable or 
non-quantifiablee. At the same time the major objective of 
developing a facility must not be Lost; output is what an 
agency wantSe The effectiveness of the agency is evaluated 
by just thate 

In addition, a certain amount of intuition about how 
the intangibles and non—quantifiable variables influence the 
output is desirable. In order to use this intuition 
effectively, a systematic procedure is needede This paper 
has demonstrated one way in which a systematic integration 
of the multi-dimensional scoring method into the common 
cost/effectiveness analysis can be madee 

It is the last point that best typifies the type of 
analyses being Suggested throughoute It is not really that 
5 i a is never-ending, but rather that ciftferent kinds and 
depths of analyses may be used depending upon the problems 
being confrontede And, of course, initial analyses will 
Sugeest further questionse While unexplained variance, or 
more euphemistically, unknowns, will persist and call ior 
Judgement based on the experience of planners, it is certain 
that helptul guidance will stem from such guantitative 
Studye 
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CORD STUDY TECHNICAL NOTE 26 


MEASURES FOR ASSESSING THE SIGNIFICANCE 
OF OBSERVED PARTICIPATION IN RECREATION ACTIVITIES 


Je Beaman and JeBe Leicester 


ABSTRACT 


At present, standards such as "x facilities per 
thousand people" are used as the basis for policy, and 
facility and program planninge However, these standards do 
not take into account such population characteristics as age 
and sex which directly influence the type of recreation 
programs required as well as facility designe Consequently, 
the reliability of the currently used standard or "general 
standard" is questionablee This paper presents an evaluation 
technique that considers the age-sex variable and postulates 
"Internal Standards" as a means of assessing the 
Significance of participating in recreation activitiese 

The Internal Standards are derived from recreation 
participation records available for existing recreation 
facilities and programse Population characteristics, age 
and sex, as well as the general class of recreation activity 
(ieee athletic, social, etce ) information must be available 
before rates of participation can be determinede It is 
claimed that the minimum level of data required for making 
decisions ina Large-scale operation is often the kind 
generated by the proceedure describede 


OBJECTIVE 


The objective of this paper is to present a means of 
assessing the significance of participation and frequency of 
participation in recreation activities in relation to the 
planning and management processes for recreation areas and 
facilitiese 


INTRCDUCTION 
The current application of recreation facility 
Standards such as "x" space or facilities per thousand 


people, which is accepted and used as a basis for policy, 
facility, program and area planning, leaves much to be 
desirede This is so because the utilization of these 
standards fails to take into account such factors as a 
population's age and sex characteristics; programs 
available; geographic accessibility of facilities, all ot 
which have a marked etfect on the type ot recreation 
facilities and programs that will be usede Because of the 
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abundance of work showing the importance of age, sex and 
other factors in explaining participation patterns no 
attempt is made to review this Literature (probably the best 
known works are Volumes 19 and 20 of the OceReReReCe Report )e 

The general widespread use of the type of recreation 
facility standards referred to above can be partially 
explained by their availability as in the UeSe Department of 
the Interior's text, Outdoor Recreation Space Standards (see 
Reference 21)e This and similar books are devoted to the 
Listing of standards for recreation areas and facilities for 
a comprehensive assortment of outdoor activitiese In such 
volumes, there is a mechanistic matching of facilities need 
to numbers of peoplee Of course for planning and management 
purposes one could go to the other extreme of elaborate data 
collections and analysis involving the use of factorial and 
cluster programse However, interpretation of the results of 
such sophisticated and costly programs are generally beyond 
the means of the majority of recreation administrators and 
planners, and so go unusede 

What is needed is a relatively simple methodology to 
facilitate evaluation and planning decisions and to present 
such is the business of this articlee It is important to 
note that although this paper deals with "national" 
recreation management and planning, the approach can also be 
used in neighborhood, municipal, regional or provincial 
contextse Actually, the data used in this presentation were 
taken from the Canadian Outdoor Recreation Demand Study and 
detailed documentation on them is provided in the CORD Study 
Data Documentation Volumee The data used were collected in 
1972 from each of the 4000 individuals, ten years of age and 
over, on their participation in eighteen outdoor recreation 
activitiese But, the method of analysis presented here was 
developed by the authors in 1967 while conducting a study of 
participation at community centres in the City of Winnipege 
Rather than conducting a survey of Winnipeg's residents, 
existing participation data from community centres were used 
in generating assessment measurese 


COMPUTATIONS 


The rationale behind the assessment procedure presented 
is simplee Once the rudimentary procedure is understood, the 
relevance of a myriad of variations of the. basic procedure 
to planning or to management problems becomes apparente 
Also, how the appropriateness of different assessment 
measures depends upon the objectives of a given analysis can 
be easily explained when the mechanics of computing 
assessment measures can be ignored and the focus of 
discussion can be the "meaning" of the numbers computede It 
is for these reasons that this section is devoted to 
introducing the computational proceduree 

Before presenting some information on obtaining the 
numbers in Table 1, it is convenient to give interpretations 
of some of these numbers both to aid one in reading Table 1 
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and the discussions regarding ite So to proceed, one should 
Look in Table 1 and find the following weighted survey 
results which relate to peoples participation in outdoor 
recreation between 1 November 1971 and 31 October 1972: 


(1) 19 = Number of Male Historic Site users 
age 10 to 19 for the Atlantic 
Provinces 

(2) 91 = Total number of visits by 30 to 49 
year old Western Canadian females 
interviewed, to | Historic Sites 
(amount of Use) 

(3) 59 = Number of 10 to 19 year old Atlantic 
Provinces males interviewed 

(4) 46 = Number of 30 to 49 year old Western 


Province temales interviewed 


(5) 2-96 = Average number of Historic Site 
visits by 10- 19 year old Atlantic 
Provinces males (Amount/Number of 
participants ) 


(6) 5-05 = Average number of activities 
participated in by 30 to 49 year old, 
Western Canadian females who 


participated in any of the 18 
activities considered (amount/mumber 
of participants ) 


(7) 3S = Number of 30 to 49 year Ontario 
females who participated in at least 
one of 18 activitiese 


(8) 259 = Totat number of the 18 activities 
participated in by Quebec males 50 or 
more years of agee 


Reference to weighted survey results was made above 
because in the national survey from which data were obtained 
it was necessary to give different weights to different 
individuals to reflect the importance the individual should 
havee For details on the survey design and weighting one may 
see the CORD Study Data Documentation, specifically the 
chapter on national surveySe 

To understand what information on total numbers of 
participants and amount of participation means in Table 1 is 
simple when one considers historic site use or fishinge In 
the case of these two activities the survey data include the 
answere to the question "How many times did you participate 
in the activity X?" (where X is a list of 18 activities)e If 
a person indicated some participants one Simply adds up the 
weights for all of the participantse The total amount of 
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participation for people in a certain category (eege age-sex 
group) is obtained by multiplying the number of times that 
each individual indicated he participated by the weight that 
that individual has and adding up these products for all 
individuals in a certain categorye For those who are not 
familiar with weighting of data, it is easy to see that each 
individual has a weight of 1 so that what is being added up 
is the number of times that the individual says’ they 
participatede 
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TABLE 1 


INFORMATION ON NUMBERS OF PEOPLE INTERVIEWED, 
NUMBERS PARTICIPATING IN THREE CLASSES OF ACTIVITIES 
AND AVERAGE AMOUNT OF PARTICIPATION PER PARTICIPANT 

( AMOUNT/ PARTICIPATION ) 


Number of Participants Amount of Participants 


S Region Region 
Age E 
Group X Atle Quee Onte West BeCe Atle Quee Onte West BeCeo 
HISTORIC SITE USE 
M 19 62 99 27 45 S67 erst 9313) 9.2325 e129 
F 19 46 109 19 35 169 124 393 67 107 
FISHING 
10-19 M A232. 1A0Pe1152 29 70 857 1003 2428 398 888 
F 10 60 68 20 31 65 428 794 103 138 
18 ACTIVITIES GROUP 
M 56 179 216 49 108 402 1309 1746 396 8:15 
F 42 171 189 52 101 250 1088 1340 410 704 
HISTORIC SITE USE 
M 10 33 65 11 33 54 401 328 54 134 
F 15 40 54 16 29 72° 185 %°230 48 181 
FISHING 
20-29 M 16 57 60 22 33 116 494 763 152 271 
F af 21 37 8 24 32 79 323 20 141 
18 ACTIVITIES GROUP 
M 50°, 101 si24 33 69 159 628 784 172 417 
F 34.'107' 125 25 67 157 506 762 151 326 
HISTOKIC SITE USE 
M 16 47 71 19 25 78116" 322 “£50 121 
F 10 54 716 19 39 37 208 277 91 150 
FISHING 
30-49 M 30 56 83 23 39 318 426 615 267 493 
F 11 29 40 7 20 143 195 261 43 179 
18 ACTIVITIES GROUP 
M 39 121 166 46 74 183 567 825 235 383 
F 32 157 166 39 83 118 601 785 197 383 
HISTORIC SITE USE 
M 7 20 49 17 19 20 56 306 162 SS 
F 16 30 45 13 28 325¢219 44199 55 123 
FISHING 
50+ M 8 22 50 16 30° "129" 1 46"GS6S422Y Sie 
F 0 9 8 iy 7 8) 70 92 109 40 
18 ACTIVITIES GROUP 
M 25 82 131 38 68 82 252 435 147 291 
OF 31 95 114 36 62 89 277 351 12119 216 
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TABLE 1 (contd) 


Amount/Participation Number Interviewed 


Region Region 
Age " 
Group Atle Quee Onte West BeCe Atle Quee Onte West BeCeo 


HISTORIC SITE USE 

H 2696 2444 3616 8659 286 

S 8-89 89 3060 3653 3-05 

FISHING 

10-19 F 2004 9.9 15.697 13.72 12-68 59 191 210 49 110 

605 Tell 11¢67 5015 448 43 177 196 53 101 
18 ACTIVITIES GROUP 

1 °$67618 74687 23608, 78.0877\7-45 

i 5e95 6624 4025 7e89 697 
HISTORIC SITE USE 

H Sel 12615 5-04 491 4206 

S 468 40.62 42e25 3-0 6624 

FISHING 

20-29 F 10637 859 12e71 6690 8621 32 110 129 33 71 

| 4657 §3676 °° 8072° 25 5e89 39 113 140 26 69 
18 ACTIVITIES GROUP 

1 4663 6¢e21 632 5e21 6-04 

Gi 4¢€59 1446 6609 6204 5-91 
HISTORIC SITE USE 

H 4c69 2634 4653 7-89 4-84 

S §367 3084 3664 4.79 3-85 

FISHING 

30-49 F 10-6 726 7039 11-61 12264 44 144 180 48 $0 

13-9 4096 6e52 6014 82695 44 177 230 46 100 
18 ACTIVITIES GROUP 

1 40669 4ce68 4-696 50e11 5017 

i 3669 3-79 4.673 5-05 4.61 
HISTORIC SITE USE 

H 2e85 28 6024 92653 3011 

S +220 Tel 4442°"4.23" +4.39 


FISHING 
50+ F 15e¢33 6663 70612 13281 17227 44 1U1lt 177 49 87 
0 TETIVLUSS. *L5eS7) *Se TF *. 46°*192°*195 43 85 


18 ACTIVITIES GROUP 
1 3024 3016 3Be32 BeBT 4024 
i 2087 2-91 3-07 3e31 3-44 


R><><P a> <> 
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Age 
Group 


10-19 


20-29 


30-49 


50+ 
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<x 


i 


PER CAPITA PARTICIPATION 
NORMALIZED PER CAPITA PARTICIPATION 


AND 


Per capita Number of 


TABLE 2 


AND AMOUNT RATES 


(BASED ON DATA IN TABLE 1) 


Per capita 


AND AMOUNT RATES, 


Participants Participation 
Region Region 
Atle Quee Onte West BeaeCe Atle Que e Onte 


032 
0 44 


ell 
023 


095 
098 


eJl 
e 38 


e50 
018 


094 
e 87 


036 
e23 


e68 
e25 


089 
073 


e16 
e35 


HISTORIC SITE USE 


e32 0647 4.55 «41 e95 
026 e56 e36 e335) —3e93 
FISHING 
e053 e722 0659 264 14653 
e394 2035 e338 31 1-51 
18 ACTIVITIES GROUP 
094 1203 1-00 98 6281 
e397 =¢9606«6298 14200 «5681 
HISTORIC SITE USE 
e030 «50 0¢33 «46 1469 
e035 e39 e662 642 1485 
FISHING 
e552 e47 ce67 246 3e63 
e19 226 31 «35 e 82 
18 ACTIVITIES GROUP 
e92 096 1e00 ¢«S7 4297 
e95 e689 «96 1-00 4203 
HISTORIC SITE USE 
e933 =8063D 66400 le 31S 1270 
e331 e377 «41 «39 e 84 
FISHING 
e399 8046 2648 049 7e23 
e16 e20 0615 e¢20 3e25 
18 ACTIVITIES GROUP 
e84 .92 .96 eS3 4-16 
e8S e882 085 e&3 2-68 
HISTORIC SITE USE 
iS e20,F 3990 e222 e 45 
e23 026 82630) 239 270 
FISHING 
e20 028 e333 034) 2280 
007 «05 616 .08 0O 
18 ACTIVITIES GROUP 
e774 74 678 «78 1286 
eJ2 e65 e884 ¢73 1493 


CHAPTER; VIIi 


0J9 
270 


West BeCe 
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Normalized Normalized 


Per capita Number of Per capita 
Participants Participation 
S Region Region 


Age E 
Group 4 Atle Quee Onte West BeCe Atle Quee Onte West BeCe 


HISTORIC SITE USE 
M 1585 w5S 1485) 41400;0675. 0162096447! 9431945 0000625 
F Dia) +46 1.001) .64 92:62 4400) 448 74519432) ) 227 
FISHING 
10-19 M 298 . teffGe'1s003,. @S2 94°88!) 44:00") 436. 14279505560 "45S 
F 060 «89 .92 1-00 «81 637 «60 1-00 .48 .34 
12 ACTIVITIES GROUP 
M 292)? O91) 2000: 299 @295 4/282) 2:8201.'1001') 16 979).%89 
F 2982.97 3 @~D65698891.00) (275'> 79 @©88 44.00i v.90 
HISTORIC SITE USE 
M 062 0e60 1-00 66 2982 0461-00 .69 «45 52 
F 61. 056 063. 1000 ..¢667 «TD, 0-62 062, eT 1.00 
FISHING 
20-283 M e753 =e 77——i ce 70: 1000 «826 68) «=e 610 0S ee 76 10200 «22 78 65 
F WS5ioomS4 274 > WSS) 4600: w35: G0! 1-00: -33)¥ «88 
18 ACTIVITIES GROUP 
M 094 082 0961-00 297 82 094 1200 86 .96 
F 087 e895 e89 096 1600 069 e077 «94 1-00 98 
HISTORIC SITE USE 
M e980 «e882 06987 1600 0677 0654 024 55 1-00 48 
F 056i ae 7614 290 2000 6 60S, ©4259 %59%5468 45:00, 2076 
FISHING 
30-49 MALL eo OO Ae De. «O72 70 wI2 dkew00 wAlgi ith 77. w 85 
r 1.00 nie64. BOG «60 ~80i 1-00) | 2 347440) 1 28) 1455 
18 ACTIVITIES GROUP 
M e930 e 87 «  w96 (4200! 297 «85 9 @ 8011693 4200." .98 
F a LeDO eee” 695) eS. we GS. elo aieh0L aeDD oie 80 
HISTORIC SITE USE 
M 246. eSteeheO? 6002 «63... 013. p85. 4052. 4,00' 12.20 
F 1.00. 2661) 6 1400686 @94 043 12.00) lw7t) FQ’. 90 


FISHING 
SO+ M 053 «60¢58) «60820 9T 1200 8206470 86220 6 0S4 6676 1-00 
F O 043 «31 1-00 -50 O 0o21 621 1-00 18 


18 ACTIVITIES GROUP 
e£5 095 1200 12-00 2-55 «70 e74 «90 1-00 
80 e866 0771200 «87 e¢70 «76 073 1-00 92 


os I 
® 
~] 
> 


OL OTOLOLO 
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A problem in interpreting the information in Table 1 
has to do with the "18 activities group". It is to know what 
is meant by this variablee This variable was created by 
determining how many out of 18 activities each individual 
participated in at least onces Therefore, @ variable was 
created that has values between 0 and 18, with O indicating 
no participation in any of the 18 activities considered and 
18 indicating participation in all of the activities 
considerede In the case of this variable, participated 
information indicates the proportion of the population that 
participated in at least one of the 18 activities (had a 
non-zero value for this particular variable)e The amount of 
information for this variable was generated by multiplying 
the weight for an individual by the number of activities 
that the individual actually participated in at least onceée 
So the grand total amount is the weighted total number of 
activities in which alt individuals participatede This 
explains the interpretation given to items Six, seven and 
eight of the definitions for Table 1 which were given 
earliere | 

Two technical points deserve mention for those who may 
choose to work with the data used to perform this analysise 
One will find that some people did not respond properly to 
the questions asked and consequently there are codes that do 
not correspond exactly to the certain number of times that a 
person participated in activity X the year before the 
surveye These codes indicate responses such as I 
participated several times or I participated monthlye Such 
codes were translated in a systematic way that is documented 
in the document section of the SPSS file from the datae The 
fite is available trom the LEISURE STUDIES DATA BANK, 
Waterloo Research Institutee The other point is, that for 
the purposes of the record the 18 activities on which the 18 
activities group variable was based are: 


1e Tent Camping 10-6 Sightseeing - private vehicle 
2e Trailer Camping 11. Snow Skiing 

3Je Pickup Camping 12e Snowmobiling 

4. Hunting 13-e Picnicking/Cooking Out 

Se Power Boating 14. Walking/Hiking 

Ge Canoeing 1Se Ice Skating 

Je Sailing loe Horseback Riding 

Se Bicycling 17e Visiting Historic Sites 

9. Fishing 18-e Driving For Pleasure 


In themselves the numbers in Tables 1 mean Little since 
they depend on the size of samplee For example numbers Like 
(3) and (4) defined earlier are numbers to which a total 
number participating must be relatede To correct the 
"totals" So that they reflect the distribution of 
population, a second group of numbers must be calculatede 
Here these rates are called per capita attendance 
coefficients (PAC) and are computed by dividing numbers in 
the upper part of Table 1 by corresponding numbers 
interviewed from the lower right section of Table le One can 
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see that the numbers in the upper part of Table 2 (such as 
the Per capita Frequency for Historic Site Use by western 
females 20-29) is obtained by dividing 48 from the upper 
right of Table 1 into 26 from the lower right. 

The coefficient produced in this way is referred to as 
a per capita participation rate if it is based on 
participation figures, while if it is based on amount of 
participation it is referred to as per capita amount ratee 
In terms of numbers presented earlier one may note that 
there should be no misimpression that a per capita rate of 
participation reflects the behaviour of an “average" 
individuale The participation information shows that 
varying proportions of the population in different age and 
sex groups in different regions participatee Average amount 
numbers like (5) and (6) show that there is often a high 
level of participation by the participants in an activity 
five or more times per year, while one knows from the other 
information that less than half the population participates 
in that activitye Obviously, an average number of 
participation per person in the population gives a very 
deceptive picture of how participation is distributed over 
the populatione These are obviously enthusiasts for 
particular activities whose 100 participations per year can 
become confused in an average with one participation by each 
of a hundred individuals or with two participations by 50 
individuals, etce One may note that one Canadian Outdoor 
Recreation Demand Study project carried out by Romsa (see TN 
10, Table G3) indicates’ that many Canadians are "non 
participants" in outdoor activities while many others do 
Little else than drive for pleasuree The 18 activities 
variable conveys Similar information by showing that a very 
large part of the Canadian population only participated in 
one of the 18 activities considered during the year 
preceeding the surveye 

One can see from the top part of Table 2 that in terms 
of participation in an activity the degree of service to 
each age-sex group in each region varies greatlye The 
distance participants must travel to historic sites, for 
example, varies in different areas otf Canada as does access 
to fishing areas or the availability of many of the 18 
activities considerede Socio-economic characteristics are 
also different in the various regions of Canada and these 
differences may affect interest or financial ability to 
participate in certain activitiese These and other factors 
Lead one to expect and accept the variation in per capita 
rates from age-group to age-group and from region to regione 
It is these kind of "internal" factors that are the factors 
of interest to recreation planners who need a standard 
against which to plane Also the information conveyed by the 
variance in per capita coefficients is information that may 
be used by managers in assessing the effectiveness of a 
program or the viability of a proposed plane 

However, the per capita figures are not the most 
convenient figures to use in making comparisons which often 
should be made in planninge By looking at the top part of 
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Table 2 and the bottom part of Table 2 one will see that the 
Simple operation of normalizing the per capita coefficient 
to the best coefficient for a particular age-sex group makes 
it much easier to tell at a glance what is going on ina 
@iven regione For example, one can see that Ontario or 
Western Canada consistently set the standard for 
participation have a coefficient of 1.00 for males and 
females for the different age €~roups but Quebec and the 
Maritimes sometimes set the standard: in terms of frequencye 
Of course, Quebec City and Montreal residents visit a 
historic site as a regular part of a visit to downtowne 

Regardless, the lower part of Table 2 is the focus of 
attention here for reasons that are made clear subsequently 
so the ratio of main concern in this paper is the relative 
attendance coefficient, RACe As just noted, to get the RAC 
the PAC coefficients are normalized by dividing the best 
participation rate for a given age-sex activity group into 
the similar coefficients for other regionse Consequently, a 
group from a particular geographic area is compared toa 
standard that is set by people elsewhere (or themselves) 
with similar age-sex characteristics; thus an "Internal 
Standard" is definede The “internal Standard Approach" is in 
fact the name given to the methodology described here when 
first deweloped by the authorse 


FURTHER DEFINITION OF ASSESSMENT MEASURES 
AND RELATED CONSIDERATIONS IN USING THE MEASURES 


The preceding section of this paper has presented a 
methodology for computing measures based both on 
participation in activities and amount of participatione One 
may wonder why both of these should be considerede 
Certainly, it cannots, in general, be concluded that 
considering both is redundant as already illustrated when 
regions that set a participation standard do not set the 
trequency standarde If the objective of an analysis is to 
understand the loading of facilities and there is not any 
particular concern with the equity (access by all the people 
in an age group) with which this Loading takes place, then 
information on the amount of participation is certainly 
adequate for analysise 

However, consider for example that the participation 
information really amounts to presenting information on what 
percentage of the people ina Biven age-sex group, in a 
Eiven area participate at all in an activity (or in a 
collection of activities )e Nowy, iff 10% of the people 
participate in an activity, even if this results in one 
participation per capita (based on the amount figures), 
there is not equity in the sense that there is a broad base 
of participation in the activity or class of activities (see 
TN 32 for another approach to considering equity )e To cite 
one further example, just because the number of visits to 
National Parks in a year is in the millions does not mean 
that a high percentage of Canadians are uSing national 
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parkSsSe Because the number of park visits in the province of 
Ontario residents is very large does not mean that a high 
percentage of Ontario residents are using Ontario parks, 
etce Particularly when such factors as the repeat use of 
facilities by scouts, church groups, etce results in some 
people having very high participation Level, some people 
having nominal participation levels, some participating only 
a little and some not at alle Information on amount of 
participation can be extremely misleading (as in fact can 
information on participation at alle 

The point that should be clear to the reader now is 
that in any analysis that is carried out the questions that 
must be asked, are: What is to be evaluated? What is an 
appropriate way to evaluate it? Should it be evaluated by 
determining what percentage of the population participates 
in an activity? It is more important for the objective of a 
given analysis to know about the amount of participation? 
Are both important, and if they are, how will results on 
these two factors be combined? 

It is only through asking these kinds of questions that 
one will avoid the ad hoc use of an evaluation methodology 
possibly resulting in answering the right question with the 
wrong information, or answering a variety of questions with 
information that is quite irrelevant to what should really 
be donee As is indicated subsequently, there are cases when 
the RAC coefficient should not be used, but rather one 
should consider how participation is changing with agee 
Obviously when RAC's are computed age effects are eliminated 
from consideratione So, if this is the objective, one should 
not use the kind of analysis endorsed here and conclude 
because the RAC's do not vary with age that everything is 
fine as one moves from one age group to anothere The method 
described has been chosen so that differences in 
participation by the different age groups can be ignorede 
This was done because such a policy orientation is relevant 
in a broad class of problemse The policy or planning matter 
of concern in such cases is whether people in the same age 
group in different areas are being served in an equitable 
way! 


TRANSLATING TO PLANNING SCORES 


As implied in the Last sectiony recommendations to 
promote action in the programming or aevelopment of 
facilities can be made for age-sex groups ina specific 
Locale by using, the RACe However, the difference between an 
RAC of e7 and one of e6 may be of no practical signiticance 
given the reliability of RAC coefficients, the accuracy of 
planning, the accuracy of policy formulation or for other 
reasonSe The point here is that information should not be 
presented in more detail than is necessary or usefule Table 
3S was prepared to reproduce the bottom part of Table 2 in as 
much detail as the authors thought was necessary in most 
planningze They believe there should rarely be the need for a 
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final display of RAC for planners or management to involve 
more than a 3 or 4 point scale (Ay By C or Ay By Cy D)e To 
present RAC in two significant figures, even if they are 
accurate, is presenting more information than is useful ina 
given program or facility planning exercise or in most 
program assessment and proposal evaluation exercisese 

It should be clear that the RAC values defining the Ay, 
Bs C, and D standards must depend on the particular 
situation being considered as well as the statistical 
concern that allows for the range in the RAC walueSe 
Clearly, there should be a wide leeway in how these values 
are definede The planner may wish to express the fact that a 
general low level of cultural activity is deserving of 
priority consideration over actions in relation to another 
activitye ; 

In terms of the results presented in Table 3, the 
following points should give the reader a practical insight 
into what was being alluded to abovee 

The scores in the lower part of Table 2 have been 
translated into A, By, C and D ratings in a way consistent to 
some extent with their variation so that at a glance one can 
see what is occuringe The scale for transformation is given 
in Table 3-e The reason that a special scale is used for the 
eighteen activity group participation resuits is to make it 
clear that in this case an A score is considered to be any 
score above eS: variation between about «8 and 100 is not 
seen as being very significante No behavioural basis really 
determined the choice out of 100 to 70 as aaA_ score for 
participation nor was 100 to 75 chosen as an A_ score for 
frequency for any behavioural reasone However, the 
difference between the participation scale and the frequency 
scale was chosen intentionally so that with the frequency 
information one would be aware of the very broad range of 
variation of the frequency coefficiente 

No more effort was spent on defining these translated 
scores so they would have meaning because the high level of 
geographic aggregation means that the coefficients do not 
mean too much in terms of behavioure Obviously people's 
access to historic sites varies drastically within western 
Canada, the Atlantic provinces; infact all the regions 
cons iderede The same is true of people's access to 
opportunities to fishe A level of aggregation which focused 
attention on areas within which travel is reasonably 
feasible would result in coefficients which could be 
considered much more meaningful because then people would be 
related to the supply accessible to theme The material 
presented here was for illustration purposes; not as input 
to policy makers or as an example of the "Level" at which 
analysis should be carried oute 


STATISTICAL AND DATA GATHERING CONSIDERATIONS 


In aiscussing the introduction of Ay By Cy D scores it 
was mentioned that one reason for using such scores was not 
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TABLE 3 


NORMALIZED PER CAPITA PARTICIPATION 
AND AMOUNT RATES (OF TABLE 2) 
TRANSLATED TO "NEED FOR ACTION SCORES" 


"Translated" Normalized "Translated" Normalized 
Per capita Per capita . 
Participation Rates Amount Rates 
S Region Region 
Age E 
Group X Atle Quee Onte West BeCe Atle Quee Onte West BeCeo 
HISTORIC SITE USE 
M B B A. A A D D C A Cc 
F A B AD B B A D B Cc Cc 
FISHING 
10-19 M A A A A A A Cc A B B 
F B A A A A Cc B A Cc Cc 
18 ACTIVITIES GROUP 
M A A A A A A A A A A 
F A A A A A 
HISTORIC SITE USE 
M E B A B A Cc A B C B 
F B B B A B B B B B A 
FISHING 
20-29 M A A A A B B A A A B 
F b B A A A Cc A c A 
18 ACTIVITIES GROUP 
M A A A A A A A A A A 
F A B A A A A 
HISTORIC SITE USE 
M A A A A A B D B A Cc 
F B A A A A Cc B B A A 
FISHING 
30-439 M A B B A A A G Cc A A 
F A B A B A A C Cc Cc B 
18 ACTIVITIES GROUP 
M A A A A A A A A A A 
F A B A A A A 
HISTCRIC SITE USE 
M B B A A B D D B A D 
F A B A A A Cc A B A A 
FISHING 
50+ M B B A A A Cc D Cc A A 
r Cc B Cc A B D D D A D 
18 ACTIVITIES GROUP 
M B A A A A B B B A A 


F A A B A A B A B A A 
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TABLE 3 (contd) 


The following scores define A,»yB,yC,D for participation: 


(1) Historic Site Use; and 
(2) Fishing: 


100 - JO = A 
69 —- 40 = B 
39 =" "0 Cc 


(3) 18 “Activities Group": 
100 - 80 = A 
719 - O= 8B 


For frequency: 
(1) Historic Site Use; 


(2) Fishing; and 
(3) 18 “Activities Group": 


100 - 75 = A 
74 - 50 = B 
49 - 25 = Cc 
20) (= "0 Se) 


<> <>< <9 <> 


to exaggerate the apparent accuracy of the coefficients that 
were calculatede Actually, in raising the issue of 
statistical accuracy two matters are importante For the 
moment these may be identified as: 


(1) What cost is acceptable in carrying out a study? 


(2) What accuracy is needed? 


These two considerations are interrelatede If a survey 
is being carried out, or even if information is being 
collected through the use of administrative records so that 
a evaluation measures can be computed, cost increases as the 
amount of information collected and/or processed increasese 
If the acceptable cost for data collected is too low to 
produce worthwhile results, then a project should not be 
initiatede On the other hand, one of the most reprehensible 
practices now followed is that of establishing a budget for 
a project without consideration of what accuracy is needede 
Someone decides that $40,000 is necessary to collect data 
and the number of interviews to be carried out is determined 
on the basis of having $40,000- 

Part of the purpose of having a good statistical design 
for a survey is in seeing that the objectives of the survey 
are achieved at a minimum coste If the kind of analysis 
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described in this paper is to be carried out and if that the 
planners’ are willing to accept certain error, then the 
objective of the survey design is to see that information is 
collected with an accuracy that is compatible with the 
planning needse It may be necessary to carry out the survey 
as a "pre-test" to see what kind of accuracy can be 
obtainede The important step is to be able to state clearly 
how far one is willing to go from the qualitative RAC scores 
to fan A, By Cy D type of scoring that is adequate for 
planners and thus to be able to state what accuracy is 
really needed. 

While these considerations are important to the planner 
and administrator there are others that must be of 
importance to the survey designer or research person 
responsible for seeing that data are collectede Here is the 
problem of bias arising due to Lapses of memory.e. There are 
other matters that need to be considered in the design of 
guestionnaires and in administrating questionnaires that 
have to do with creating a disposition that will cause a 
person to respond correctly to a questione 

One important point to note regarding statistical 
concerns and the difficulties that arise when information is 
obtained by surveys is that as indicated earlier 
administrative records may be the source of information for 
the kind of analysis described. Whether it is parks 
organizations that process information from hunting Licences 
or Parks Canada that processes information obtained from 
some kind of permit, there are distinct advantages to using 
administrative informatione In many cases it is quite clear 
what a permit entitles a person to doe Or if the issue of 
concern is not whether the person carried out the activities 
but simply that he was entitled to carry it out, then 
administrative records can be particularly goode There are a 
whole other class of administrative records which in fact 
are records of people's participation in certain activities 
and these records are ideal information on which to base 
analysis of amount of participatione The problem of recall 
bias is eliminated when such records are used and the only 
problem that remains is that of capturing all participation 
of a given type at facilities of concern (if in fact the 
concern is with all participation in a given activity or 
class of activity )e 


OTHER CONSIDERATIONS IN FORMULATING 
AN ANALYSIS CONSIDERING OBJECTIVES 


The preceding discussion has treated participation in 
an activity as the "object of interest". However, to take a 
particular example, not only does the Federal Government 
operate historic sites; there are provincial historic sites 
and there are municipal and locat as well as’ some private 
historical sites or displayse If different organizations or 
agencies are responsible for programs, it may be desirable 
to consider what proportion of the RAC is accounted for by 
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activities sponsored by each agencye Such an approach might 
in fact be useful to the agency in evaluating the degree to 
which their dollars are spent as effectively as the dollars 
of some other group involved in the same program areae Why 
should one not ask what percentage of the market for use of 
historic sites goes to National Historic Sites rather than 
to provincial or other historic sites? At the same time why 
should not the relative budgets be considered to look at the 
relative efficiency in investment? The preceding is taken as 
an example not because it is a particularly good example, 
but because many people may immediately react to it from a 
policy perspectivee They may ask themselves if a Nationat 
Historic Site really provides the same commodity as a 
provincial site or they may ask themselves some other 
relevant questionsse Similarly in a municipal situation it is 
important that evaluations not proceed at a face value and 
result in criticism of an organization that works in a 
problem area because it incurrs a higher cost for the amount 
of participation that it encourages whereas a municipal 
agency appears to offer the same program at a relatively Low 
coste Evaluation is a tricky businesSse Evaluation based on 
a methodology being used without the insight derived from 
careful comparison of project objectives and project targets 
can be a destructive influence that calls something a 
failure when in fact it is a succesSe 

Turning to a completely different matter, one should 
note that the coefficients suggested here are not the only 
ones that can be developed from the kind of data usede Many 
of the readers have already no doubt scanned the upper part 
of Table 2 and noticed how the per capita coefficients drop 
off with agee In many situations this drop off is justified 
because, as suggested earlier, any "discrimination" against 
age groups is a matter of policye However, it is not 
inconceivable that an organization, in examining its policy, 
may wish to took at the relative Situation of different 
groups over various ages in different geographic arease In 
this case coefficients similar to the RAC coefficient could 
be developed by simply normalizing, by dividing by the 
Largest PAC coefficient regardless of the age or sex group 
for which it occurrede This might be appropriate in the case 
of a national health and welfare program for which the 
objective was encouraging some kind of physical activity 
within a certain range of activitiese In this case a 
variable like the 18 activities variable might be generated 
for several fitness activities and the results of the 
analysis would be meant to focus on the success of involving 
the Canadian population with respects to their being active 
in at least one of these groups of activitiese 

When one starts to broaden out the range of 
consideration along the lines indicated in the Last 
paragraph, one often begins to think of the desirability of 
knowing whether or not a person actively participated in an 
activitye By careful definition of what is meant by being a 
regular participant it would be possible to get answers to 
the questions "Did you participate regularly in activity xX?" 
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With the response to such a question one could prepare a 
table that would give a profile of people in terms of what 
they do regularly rather than what they may have happened to 
do at least once during a yeare One could elaborate on the 
18 activity kind of analysis to see where, between the 
extremes indicated by amount of participation and the 
minimum indicated by participation, people's regular 
participation liese But unfortunately, pursuing any of these 
ideas becomes the topic for another papere Also, pursuing 
them without particular objectives. results in much obtuse 
and hypothetical comment. So, the authors consider it more 
important to terminate this section, hoping that some of the 
ideas presented allow the reader to see the broad range open 
for applying, modifying and extending the analyses technique 
introduced heree 


THE ROLE OF THE PLANNER AND/OR MANAGER 


The preceding material has been clear about the 
importance of objectives in deciding how the methodology 
proposed in this paper should be applied and regarding what 
data should be usede However, what it has not stressed is 
the importance of the planner and/or manager, not so much as 
a specifier of objectives but as an active and important 
element contributing to the actual analysise In the original 
study in which the method presented here was developed, 
there were neighborhoods in which community recreation 
centre use was very lowe Some of these neighborhoods had Low 
use of facilities, lack of program, etce Other neighborhoods 
showed tow per capita use of community facilities because 
there were ethnic communities with such a high degree of 
solidarity that they did not use community facilitiese 
Within the City of Winnipeg; Mennonite and Ukranian groups 
(to cite but two examples) have extensive facilities of 
their own either associated with ethnic community centres or 
with churchese Tt is important to recognize this 
relationship and to accept that it would be foolish to 
attempt to increase use of public facilities in these ethnic 
communitiese It is necessary that the people involved in 
carrying out analysis be conversant with the areas that they 
are studying and the social and geographical factors that 
influence what goes on in those areaSe 

If the kind of anlaysis proposed here were carried out 
for a particular province, it would be relatively easy in 
most provinces to find geographic areas in which special 
considerations would be relevant in understanding the 
activities in which people participatede Much more obvious 
with respect to certain activities Like hunting and fishing 
is that a game warden or resource manager can (from his 
Knowledge of the availability of good hunting or good 
fishing) give a ready idea of what some of the variations 
and RAC coefficients mean without the need itor becoming 
involved in mapping the amount of supply available or in 
developing maps of the potential to participate (see; for 
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example, TN 5, TN 16 and TN 17)e 

These comments are not meant to indicate that the 
results of analysis procedures such as those presented in TN 
S and 17. are not useful or that work on supply measurement 
procedures such as that reported in TN 16 is without value. 
Rather, the point which is seen as important enough to 
stress this article is that today there is a tendency to use 
overly sophisticated quantitative analyses and to put too 
much reliance on external experts (who unfortunately are 
often not experts) while ignoring a great dealt of knowledge 
that is contained within the experience of people who know 
the area being studiede Yes, it is tricky using local 
knowledge without getting local bias but this does not mean 
one should not trye 


CONCLUSION 


It may appear that the authors are making an issue of 
something no more sophisticated than ordinary age specific 
participation ratese Yet, any economist, sociologist or 
demographer knows that though such rates are Simple, their 
use is one of the most effective ways to understand a social 
phenomena where a large number of events are involvede 

However, in conclusion the suthors feel it is important 
to stress again that the technical considerations reviewed 
should not dictate the ultimate decision as to the "real" 
Significance of any coefficiente Every evaluation involves 
a policy decision and a data analysis methodology should not 
define policy, but only be useful in policy definition and 
evaluations If it is policy not to provide recreation 
facilities or programs for adults or aged, it is not 
surprising that adults do not appear very frequently in 
attendance figurese Should it be a policy to provide school 
programs at historic sites in a certain geopgraphic area, 
the fact that this area sets the standard does not imply 
that the other areas are being deprived of a program to 
which they are entitled (or that they need)e "The planning 
tool" which should not be forgotten is the integration of 
knowledge of the areas studied and of a political situation 
with the quantitative guidelines that can be set by defining 
measures using the methodology presentede 
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ESTIMATING THE ECONOMIC IMPACT 
OF PROPOSED PARKS 


A Summary of the Report 
ECONOMIC IMPACT OF NATIONAL PARKS IN CANADA 
By Hildebrandt-—Young and Associates Ltde 
Prepared Under Contract for Parks Canada 1970 


Edited By 
GeJe Gauthier and ReHe Stanley 


ABSTRACT 


To study the economic impact of establishing national 
parks, Parks Canaday in 1970, commissioned the firm of 
Hildebrandt-Young and Associates Ltde to undertake a study 
that would review existing methodologies; propose a standard 
methodology for Parks Canada; and test this methodology on a 
proposed parke 

This paper is a condensed version of the consultants’? 
two-volume reporte It is divided into three parts: 


Ie the conceptual framework; 


IIe the Gros Morne Park proposal; a case 
approach; 


IIIe an annotated bibliographye 


The original research does not propose the adoption of 
a specific methodology for all such studies, but does 
provide the reader with procedures for evaluating the impact 
of a National Parke 


INTRODUCTION 


The economic impact ot establishing national parks has 
been of major interest to the Government of Canada for some 
yearse In 1970, National and Historic Parks Branch, now 
Parks Canada, commissioned the firm of Hildebrandt-Young and 
Associates Ltde to undertake a study (see Reference 27) that 
would: 


(a) review existing methodologies; 


(b) propose a methodology suitable for Parks 
Canada; and 
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(c) test this methodology on a proposed parke 


This paper is a condensed version of the consultant's 
reporte Although the organization of the paper does not 
follow the organization of the report, no new ideas on 
impact evaluation have been added in revising the papere A 
commentary has been included at the end of the paper and it 
is followed by a short bibliography drawn, in part, from the 
annotated bibliography included in the reporte Alt the 
entries in the original bibliography are not included as 
most of the issues are better (or more briefly) dealt with 
in more recent Literature (see, for example, Keference 18 )e 

This paper is not intended to provide a model to follow 
in doing benefit-cost analysis, or socio-economic impact 
studiese Rather, it is intended to give the flavour of some 
of the difficulties involved in undertaking such a taske 
Theretore, the procedures endorsed here should not be taken 
to be those which are officially accepted by Parks Canadae 
Parks Canada has drafted guidelines for the execution of 
socio-economic impact studies and these deviate considerably 
from the ones followed by dildebrandt-Younge in fact, a 
paper titled "A Longitudinal Analysis of the Impact of the 
Creation of a Major Park" (TN 40) and the Canadian Outdoor 
Recreation Pesearch Committee paper on the economic impact 
of parks (Reference 27) set out a number of Parks Canada's 
concerns with regard to the economic impact research that 
has been undertaken on its behalfe 

Finally, because the title ot the article does not make 
it clear, the reader may find it useful to note that the 
paper is in two parts: 


(1) a theoretical framework; and 
(2) the Gros Morne Park proposal: a case approache 


The first part ot the paper introduces the concept of 


benefit-cost analySiSy examines the types of costs 
associated with the establishment of parks, and then 
examines the types of benefitse Fach cost and benefit is 


discussea, in so far as it is useful to do So, with respect 
tos 


(1) its nature and elements; 

(2) the problems ot measurement; 

(3) how these problems might be overcome; and 

(4) presenting the cost or benefit in accountse 

The second part ot the paper follows the same organization 


and includes tables presenting the estimateaq costs and 
benefitse 
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PART I 


A THEORETICAL FRAMEWORK FOR THE EVALUATION 
OF THE BENEFITS AND COSTS OF NATIONAL PARKS 


INTRODUCTION 


Canadians both publically and politically are of the 
view that national parks are valuablee The federal 
government is», however, faced with the problem of 
determining what priority this view has with respect to 
competing programs, or in other words just how valuable any 
newly proposed park is to those who benefit and to those who 
must bear the costse This value problem is complicated by 
the fact that many of the benefits and costs of park 
development are intangible or indirecte The cost of 
foregoing resource exploitation may not be a direct cost to 
the park, but is still very real to those who no longer 
hunt, fish, trap, etce and to those who depend on theme The 
benefits of preserving Canada's natural heritage are 
intangible, but constitute one of the prime purposes for the 
establishment of a parke Furthermore, benefits and costs 
accrue differently to the nation and the federal government, 
the provincial government, and the residents of the proposed 
park and immediate vicinitye 


COSTS 


There are three types of cost associated with park 
creations: opportunity, capital and operatinge Opportunity 
costs are the measure of the highest value which resources 
could have realized, had they not been dedicated to National 
Park use which preclude all mining, timber, harvest, etce 
In other words, it is the measure of benefit foregone 
because resource exploitation, actual or potential, must be 
foregone in the area of the parke Capital and operating 
costs are meant in their traditional sensese 


ie Opportunity Costs 


In the case of a national park, these costs arise from 
the prohibiting of timber harvests, mineral development, 
commercial fishing, agriculture and other similar resource 
use@ee The components of the benefit foregone include wages 
and cost of plant and equipment (factor payments) which 
would be paid by firms engaged in extraction or harvesting, 
transportantion,y processing or manufacturing the primary 
material into marketable, exportable or consumable forme 
Also included is a value commonly referred to as pure 
economic rent which represents the inherent value of the 
resourcee It is the residual which remains when factor 
payments, including "normal profit", are subtracted from 
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market valuee 

There are also secondary benefitse The existence of a 
resource processing industry will create a demand for goods 
and services that can bring into existence in the region 
further industries to supply the demand (backward Linkages )e 
It can also create a supply of goods and services that wiltl 
permit the creation in the region of industries who will use 
these goods and services as their inputs (forward Linkages )e 
Payments to factors (ecege Labour) get re-spent in the region 
(net of savings and imports from outside the region) and 
these spendings (again net of savings) are spent again and 
again, thus multiplying the impact of any payment made in 
the regione All these benefits are foregone if resource 
exploitation is prohibited. 

Opportunity costs for a resource in a park exist when 
the resource has a value that will be lost if it is locked 
up in the parke From the national point of view opportunity 
costs usually reduce to the equivalent of pure economic rent 
plus the earnings of factors that would remain unemployed in 
the absence of resource developmenteo However, in Canada, 
manpower is ordinarily almost fully employed (note that the 
Hildebrant—Young report was written in 1968) and capital is 
characteristically overemployed and scarceée Thus it can 
usually be assumed that where development of one national 
resource is foregone, capital and manpower wilt find 
employment in the development of other economic 
opportunities within the natione 

From the provincial point of view, the circumstance are 
very differente The resource potential of a province iS, of 
course, much more tLlimited than for the country as a wholee 
Foregoing the exploitation of a resource has a much more 
Significant impact on the economy ot a province than on 
Canadae If the capital and manpower for exploiting the 
resource is attracted to another provincey the opportunity 
cost approaches the gross market value of the resource, and 
not just the economic rent as is the case from the viewpoint 
of the natione The province also must consider the net 
effects of the forward and backward Linkages (those occuring 
within the province) as opportunity coste 

Conceptual difficulties in the measurement of 
cpportunity costs are negligible although Krutilla (see 
Reference 35) shows that matters are not all that cleare The 
key is correct description and understandings of the 
opportunities foregonee The calculation can be very complex, 
however, as often a great number of factor payments are 


involvede Actual accounting for these transactions is 
freguently impossible and some more general statistical 
measure must be usede Econometric input-output models, 


which measure the interaction of firms in the economy are in 
various stages of completion for a number of Canadian 
provinces and regionse (For example, Statistics Canada is 
Just now (1976) getting provincial models into operation.) 
These can often provide a reasonable estimate of the net 
effect of foregoing economic opportunitiese (The Department 
of Regional Economic Expansion does not at present accept 
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the estimates produced by these models. ) 

For accounting purposes, it is useful to consider 
opportunity costs as capital or operating costse For 
example, the cost of purchasing a farm within the boundaries 
of the proposed park can be considered a capital cost but in 
fact it represents to the owner the present walue of his net 
future income stream from the farm (ieee his earning 
opportunity foregone )e 


2e Capital and Operating Costs 


| Established methods for treating capital and operating 
costs are widely acceptede These costs present no problems 
of measuremente Difficulty is caused by accounting for the 
effects of dislocating communities and persons Living within 
the parke It is mecessary to determine the amount of 
| compensation to be paid both for real assets owned by 
members of the community and for hardship undergone in 
_relocatinge Here again the key is accurate description of 
the extent of dislocatione (Life tenancy is currently under 
_ consideration as an alternative to compensations ) 


BENEFITS 


| There are four types of benefits generated by the 
establishment of parks: preservation, recreation, economic 
stimulation from tourist spending, and from government 
expenditure for the establishment and operation of the parke 
The first two benefits in particular pose Significant 
| problems of measurement, and no generally accepted method 
has yet been found to measure them satisfactorilye 


(1. Benefits of Preservation 


One of the two principal purposes of a national park is 
preservation, for its ecological or scenic uniqueness, for 
the maintenance of the widest possible variety of existing 
species of flora and fauna, or for its “option value". 
Option value is the value attached by a person to the 
possibility of obtaining a good or service in the future, 
when he has no intention at present of using it, but when 
| re-establishing the supply of that good or service would be 
excessively expensive or even technically impossible once it 
had been discontinuede While it is obvious that all three 
types of preservation have value, it is equally obvious that 
it is not possible to attach firm monetary estimates to 
these, although an attempt could be made in the last case 
where something is preserved for future use (in this casey, 
recreation )e The best that can be done is to present 
| preservation as an unmeasured value which must be considered 
subjectively in the benefit-cost analysise 
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22 Benefits of Recreation 


The other principal purpose of a national park is to 
produce recreation benefitse National parks supply a wide 
variety of recreation services such as Sightseeing, camping, 
water sports, hiking, skiing and wildlife observatione The 
main difficulty in measuring this benefit derives from the 
fact that the consumer does not pay (at least directly) for 
the benefits received, as it is the policy of the National 
Parks of Canada to make parks available free or at nominal 
cost to visitorse 

In the private sector, price can be used as an 
indicator of benefite While benefits received may be more 
than the price a consumer pays, the benefits will at least 
be no lesse In the case of a park, however, price is zero or 
nominal, so it is not possible to use price as a proxye It 
is therefore mecessary to try to measure the benefits in 
some other waye From an extensive review of methods 
previously employed, three approaches offer some hopee 

THE GROSS EXPENDITURE METHOD, usually based on 
interviewing a sample of visitors, gives the total amount of 
tourist spending aS reported by tourists themselvese While 
this method may provide guidelines for investment in tourist 
accommodation and so on, the visitor's expenditures are for 
fuel, food and accommodation, and not for access to the 
parke Therefore they do not necessarily reflect how the 
visitor rates the park's valuee 

THE UNITED STATES FEDERAL AGENCY PROCEDURES, which are 
widely used in estimating recreation values, consist 
essentially of assigning standard dollar values to each 
activity park visitors participate in, multiolying the 
values by the number of visitors participating in each 
activity, and summing the values to obtain an estimate in 
dollar terms of benefits provided by the parke This approach 
has serious shortcomings: the values assigned are completely 
arbitrary, and they are assumed to remain constant over the 
project's life expectancye However, this approach can 
provide, in the view of Hildebrant-Young and Associates, a 
rough approximation of benefits obtainede In addition, the 
values have the advantage of appearing reasonable both toa 
those who use the parks and those who must consider 
allocating funds to park developmente 

THE ECTELLING METHOD is an attempt to determine the 
benefits by theoretical meanSe Users are grouped into 
&eographical zones within which travel costs incurred in 
reaching the recreation site are relatively constante For 
each zone, a participation rate per unit of population is 
founde It is assumed that the participation rate for a given 
zone would decline to the level of a more distant zone if 
residents of the first zone were charged a user's fee equal 
to the difference in travel costs facing the users in each 
zonee Participation rates can then be estimated for various 
fee levelsSe When it is known what "price" visitors in a zone 
would be willing to pay to visit the park, consumer surplus 
becomes the difference between this and what they do paye 
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There are several theoretical problems with the 
Hotelling methode There is no guarantee that behaviour in 
one zone can be used to predict behaviour in anothere 
Secondly, changing the fee would not necessarily leave the 
quality of the recreation experience unchanged (€ege 
congestion might increase)e There are practical problems as 
well: (1) demand estimates are extremely sensitive to the 
choice of zone boundaries; (2) it is difficult to define 
minimum necessary travel costs; (3) it is difficult ona 
multi-purpose trip to separate benefits attributable 
directly to the park visite Successful application of this 
method have so far been limited to the analysis of 
recreation services such as daytime or weekend visits to 
sites that offer a uniform range of servicese These 
conditions generally do not apply to Canadian national parks 
(see TN 31, 38, and Reference 18. ) 

While the US Federal Agency procedures appear to be the 
most useful method at this time (1970), and so will be used 
in this study, it is recommended that they be modified in 
three waySe Changes in prices since the technique was 
developed should be taken into accounte Changes in relative 
prices, which have tended to place a higher value on outdoor 
recreation relative to general price changes should also be 
taken into consideratione Finally sensitivity of the 
benefit (in the eyes of the user) to travel distance should 
be taken into accounte Recommended values are set out in 
Table 1 e 


3e Benefits of Economic Stimulation from Tourist Spending 


The economic benefits of tourism derive from expanded 
economic activity which occurs as a result of transfer, by 
tourists, of spending power from one region to anothere A 
tourist is considered a visitor from outside the regione 
Obviously, from the local point of view, this would include 
everyone not normally residing in the immediate areae From 
the provincial point of view, this means anyone who comes 
from outside the province, and from the national point of 
view it means anyone who is not normally resident in Canadae 

The requirement therefore for the measurement of 
benefits from tourism is to discern the level of spending 
within the region for which the analysis is being made, and 
which is attributable to the existence of the parke Because 
tourist expenditures are of considerable economic 
Signiticance to all provincial governments and to the 
federal government, well designed studies have been carried 
oute These reveal average levels of tourist spending which 
make possible reasonably precise estimates of purchases of 
accommodation, tood and fuel, and which wilt provide an 
estimate of gross expenditures by tourists attributable to a 
national parke It is necessary however, to estimate 
expenditures net of interregional flows, that is, net of the 
purchases of goods and services imported from outside the 
region in questione AS in the case of opportunity cost 
calculation, input-output models which show the’ Linkages 
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TABLE 1 


PROPOSED VALUES FOR RECREATION USE 
OF CANADIAN NATIONAL PARKS PER VISITOR DAY 


Activity Representative Unique 
Park Park 
Local Distant Local Distant 
Point Point Point Point 
of of of of 
Origin Origin Origin Origin 
le Picniking, $1-00 $1200 $1.00 $2.00 


driving for 
pleasure, high-— 
way observation, 
general use of 
intensive areas 


2e Overnight 1250 1.50 1.56 3200 
camping 

Ge Skiing, golf, 1-50 3200 1.50 6200 
eguided tours, 

etce 

4 Hiking, 2050 5200 2250 10-00 


mountain climb— 
ing, canoeing, 

nature observ- 

ation, etce 


<><><9595> 


between firms producing goods and services purchased by 
tourists and firms outside the region will provide an 
indication of the extent to which the economy of the region 
will be expanded by tourist expenditurese 


4e- Benefits of Economic Stimulation from Government 
Expenditure 


Expenditures by the federal government for capital works and 
operation of a park are costs and not benefits to the 
national accounte In the provincial context, however, such 
expenditures represent a transfer of funds into the region 
and a&@a consequent expansion of the provincial economy 
(directly, through linkages created, and through the 
multiplier eftfect )e This transfer is net of federal taxes 
paid for the establishment of the park by residents of the 
province, and net of money which drains out of the province 
for goods and services purchased outsidee Money transferred 
into a region to meet opportunity costs cannot, otf course, 
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be counted as a benefit since their effect is to compensate 
for opportunities foregonee 


PRESENTATION OF ACCOUNTS 


le Present Value and Discount Rates 


To compare costs and benefits, all values must be 
expressed using uniform units of measuree This is normally 
achieved by discounting all costs and benefits to their 
present valuee Present value is the amount that would now 
have to be invested at a given interest rate (or discount 
rate) to yield a given amount at a given future datee There 
is no practical way in which to choose a_ single discount 
rate suitable to all purposeSse The discount rate is a 
function of fiscal and monetary policy, anticipated price 
changes (inflation) and a net amount which is sometimes 
described as the real cost of capitale This last, in 
simplest terms, represents the degree to which present 
consumption is construed to be preferable to investment for 
the production of future valuese For some years, it has 
appeared that this figure in Canada has approximated 6 
percente This figure is only useful if prices and values are 
assumed to remain constant for the entire discount period 
and that fiscal and monetary policy do not shift the 
discount ratee 

The federal government's choice of a discount rate must 
take into account counter-cyclical spending requirements, 
current cost of money, and inflationary trendse The province 
must take the same thing into account, but capital rationing 
is ordinarily more stringent because of the province's lack 
of authority to change monetary policye Consequently; the 
province will tend to use a higher interest ratee The 
private individual's discount rate will be highest of all, 
as he usually has the least control on the money market and 
the Least borrowing powere 


2e Presentation of Accounts 


The benefit and cost estimates for a proposed park are 
presented finally ina set of accounts, which is reviewed 
usually at both the federal and provincial levels, and 
possibly at the Local levele Estimates .cover a fixed 
period, probably 20 yearSse Since the benefits can be 
expected to vary from year to year, it is advisable to 
present an estimate from the provincial point of view and 
from the national point of wiew for each year within the 
planning horizon: 20 separate sheets for each of the 20 
yearse 
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PART II 


A CASE STUDY: 
EVALUATION OF BENEFITS AND COSTS OF THE 
PRCPOSED GROS MORNE NATIONAL PARK, NEWFOUNDLAND 


THE PARK AND ITS ECONQMIC MILIEU 
ie The Park 


Gros Morne is the second national park in Newfoundlande | 
It is located in an area of outstanding scenic and natural | 
features, and so is considered a unique parke It is out of | 
the main stream of tourist traffic and is rather remote and 
difficult of accesSe Therefore it is forseen that it will | 
receive low intensity usec 


Ze The Local Economy 


According to the 1966 Census of Canada, the proposed | 
boundaries of the park enclose 4,834 residents in four major | 
community complexes and several smaller communities, mostly | 
fishing and portse These comprise about 1,000 nhousehotdse | 
23388 persons fall within the category 15 to 64 (employable | 
age)e The high birthrate and a significant out migration of | 
young adults accounts for the high proportion of dependantse | 
Personal disposable income per capita averages $860 a yeary 
45% of the national averagee Of this; 25% was derived from | 
government transfer payments and 75% from wages, salary and 
self employed incomee 33% of persons are employed in | 
fishinge Their net income is about $619 a yeare Average | 
income per capita is increased by the presence of 76 | 
professionals (nurses, teachers and doctors)e There is also | 
a high rate of government employment in such activities as 
road maintenance and ferry operatione The area is in general | 
economically depressed with residents supplementing their. 
cash income by cutting stovewood, hunting and subsistence | 
agriculturee 


COST ESTIMATES 

Discount or interest rates of six and ten percent are | 
recommended for federal and national costs, while eight and | 
ten percent are the recommended rates applied to provincial © 
costse The time horizon of the study is 20 yearse It is | 
recognized that Since a park is establised in perpetuity, | 
many of the benefits and costs will be in perpetuitye | 
Little certainty can be attached to estimates into | 
perpetuity thhowever, so a definite period is selectede | 
Constant dollars are used in calculationSe Relative price | 
changes ares however accounted fore ! 
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le Opportunity Costs 


Park policy requires that all commercial resource 
exploitation in the park be discontinuede Because of the 
importance of fishing, and the possibility that it will be 
permitted to continue, it will be considered separatelye 

The other opportunity costs are for foregone timber 
harvest, agriculture, and mineral development. Although the 
fuelwood harvest is of declining importance in Newfoundland, 
it provides a revenue in kind to Low income residents of the 
area, and is almost free when cut from areas adjacent to the 
park, so discontinuing the fuelwood harvest would not be a 
loss to the provincial economye It would however cost the 
local residents a net of $109,456 a year to replace the wood 
they harveste Newfoundland imports lumber, and so the 
$42,300 which it would cost to import lumber to replace the 
product from the park area's 22 sawmills would be a cost to 
the region and the provincee There is no toss to the 
national account because forest products could be 
manufactured in other parts of the countrye The cost to 
local residents of replacing the products of subsistence 
agriculture would be about $24,000 each yeare This would be 
a loss to the province which is an importer of food, but not 
to the nation, where an excess existse No information exists 
at this time concerning specific mineral occurrences or 
income in kind derived from huntinge 

A total of 254.5 square miles is held by private 
companies or individuals under lease or mineral concessione 
To the owner of the rights, the value of his holding is the 
present value of the annual net profit which can be obtained 
by using the lande There is not enough information available 
to assess the potential profits, so a value of $1200 an acre 
is assumed for each yeare This results in a present value, 
discounted at 8%, of $1,599,500~- This is a cost to the 
provincial account, since, in order to conform with National 
Parks policy, the province must transfer the land to the 
federal government free of all encumbrancese The province 
must therefore either purchase privately owned lands and 
rights, or exchange them for comparable holdings elsewheree 


2e Capital and Operating Costs 


Gross capital outlay of the federal government over the 
1S-year period estimated for the establishment of the park 
is $27,556,000 for a present value of $21,427,100 discounted 
at 6% Salaries, equipment , and supplies necessary to 
operate the park would amount to $14,123,900 at present 
value discounted at 6%. 

Capital and maintenance expenditures on such things as 
access roads outside the park are constitutionally the 
responsibility of the provincee Upgrading the access road 
would be a cost to the provincial treasury of 358,000,000. 
The province would be relieved of the cost of maintaining 
roads within the park however, so the present value of the 
net cost to the province would be $4,724,400 discounted at 
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8% over the 40-year write-off periode Basic costs are set 
out in Table 2, Part Ae 


Je Park Boundary Alternatives and Dislocation Costs 


Present National Park policy prohibits permanent 
communities within new national parkse Because of the number 
of communities in the maximum extension of the park 


boundary, three alternative boundaries were considered which 
leave certain groups of communities outside the Parke In all 
cases; some communities will require relocations 

Under the terms of the agreement signed by Canada and 
Newfoundland to consolidate communities, relocation Grant 
levels have been established, some of which are shared 
equally by both governmentse The community consolidation 
Program is voluntary, of course, but the costs and grants 
would not be less for the compulsory program suggested here, 
so these costs and grants are used for estimating purposese 
They include: $1,200 paid to each household, $200 to each 
member of the household, $3,000 for the purchase of a 
serviced Lot, $500 for moving a house; or $8,000 to replace 
one that is not in good enough condition to move, clean-up 
costs for remaining structures estimated at $250 for a 
household and such additional costs as £25,000 to replace 
each classroom and $28,000 to replace each hospital bede 

If commercial fishing were permitted, it is assumed 


that dislocated fishermen would move to neighbouring 
communities and no disruption of the fishing industry would 
occure Disposable income would merely be distributed 


elsewheree The provincial economy would not lose thise The 
loss in real property value and decline in business to 
merchants who would have to relocate is also included in the 
cost estimatese Table 2, Part B, outlines the dislocation 
costs of including the three alternative areas in the parke 


4e Implications of Prohibiting Inshore Fishing 


It was proposed to include an aquatic zone in the park 
extending approximately three miles off sShoree This would 
have the effect of prohibiting commercial fishinge As 
fishing is the principle occupation of primary industry of 
the area and much of the secondary economic activity depends 
on it, the alternative of permitting inshore commercial 
fishing was alse considerede } 

Prohibiting fishing inside the proposed aquatic zone 
would drastically erode the entire economic base of a high 
proportion of the communities of the regione Therefore, two 
of the boundaries considered above are not real options 
because prohibiting fishing would force those excluded 
villages to be relocated in any cCaSsee 

The total annual opportunity cost to the national 
economy of the fishing harvest that would have to be 
foregone is $323,380. The loss to the provincial economy 
would be $380,680. Prohibiting fishing would also entail 
paying fishermen 50% of the replacement value of their 
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equipment (50% payment is the practice set in the 
establishment of previous parks )e The gross cash payment, a 
capital cost to the park, is $158,581. Table 2, Part C 
summarized these costse 


ESTIMATED BENEFITS 


1. Benefits of Preservation 


i 


These do not create measurable oor tangible economic 
benefits and so an explicit oor monetary equivalent of these 
benefits cannot be provided (for a contrary view; see 
Reference 35 )e 


22: Benefits of Recreation Use 


To measure the benefits of recreation occuring to park 
visitors, the number otf visitor days in the first twenty 


years of the park's operation were estimated, divided into 


categories of activities participated in, and multiplied by 
the value imputed to each activity by the modified UeSe 
Federal agency procedurese (For the categories of activities 
and values, see Table le) 

In 1968, 269,000 Newfoundland residents used Terra Nova 


Park (the other National park in Newfoundland). Although 


park use had grown very rapidly since the establishment of 
Terra Nova, it was reasonable to assume that growth would 
stabilize after 1868 at about 55%, based on patterns of 
erowth at similar parks when first established and after 
several yearSse 1976 was taken as the year that Gros Morne 
would become fully operational (assuming the park was 
established in 1972 and took five years to develop and 


publicise )e By 1976, it is estimated that there will be 
413,404 visitors from Newfoundland to the two parkse 
| Examination of the amount of increase absorbed by other new 
parks located near established parks (Elk Island, for 


example) suggests that about two-thirds of the increase will 
go to Gros Morne; so that by 1976, 92,000 visitors can be 
expectede By extrapolating back to 1972 and by assuming a4 
growth rate after 1976 of 2-76% a year (set low to reflect 
low incomes and low population growth in Newfoundland 
compared to the national average); it is estimated that 
18,900 persons will visit the park in the first year 
(representing 23,220 visitor days, based on an average stay 
of 1e3 days for a visitor observed in Terra Nova park) and 
138,404 visitors (179,925 visitor days) in the twentieth 


 yeare 


Off-island visitors to Newfoundland parks must travel 
lone distances at great expense to reach theme Gros Morne 
park can therefore be likened to parks in the Yukon and 
Alaska that receive relatively few visitors but whose rate 
of increase of visitors is very high (about 12% a year)e In 
1568, off-island visitors to Terra Nova park totalled 
23,500- Applying the 12% growth rate, there will be 65,000 
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TABLE 2 


SUMMARY OF COST OF GROS MORNE PARK 
PRESENT VALUE IN DOLLARS 
National Costs 


6% 10% 
Discount Rate Discount Rate 


A 
Basic 41,341,970 40,293,900 
Cost 
B 
Dislocation 
Costs , 
In all cases 268,350 268,350 
Area A 1,388,810 1,301,780 
Area B 2,032,680 1,925,670 
Area C 5,364,550 5,120,550 
Cc 
Costs of 
Prohibiting 
Fishing 
Cpportunity 3,709,170 23 753,260 
Cost 
Equipment 158,581 158,581 
Replacement 
Total §4,264,1LULL 51,822,091 
Federal Costs 
6% 10% 

Discount Rate Discount Rate 
A 
Basic 35,551,000 33,556, 200 
Cost 
RB 
Dislocation 
Costs 
In all cases 55,200 55,200 
Area A 277,500 277,500 
Area B 368,250 368,250 
Area C 1,013,100 1,013,100 
C 


Costs of 
Prohibiting 


Fishing 

Opportunity 3,709,170 erunus<250 
Cost 

Equipment 158,581 15&, 581 
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Replacement 


Total 41,132,801 383182,791 


Provincial Costs 


8% 10% 
Discount Rate Discount Rate 


A 

Basic 6,974,800 7,301,400 
Cost 

B 

Dislocation 

Costs ; 

In all cases 213,150 213,150 
Area A 1,065,410 1,024,280 
Area B 1,607,230 1,557, 420 
Area C 4,221,450 4,107,450 
C 

Costs of 

Prohibiting 

Fishing 

Opportunity 3,825,880 39317, 740 
Cost 

Eguipment 

Replacement 


Total 17,257,020 16,957, 040 


Note: Boundary alternative I would exclude areas B and C 
from the park, alternative II would exclude only area 
C and alternative III would not exclude any of these 
arease The Total cost of prohibiting fishing would 
include the cost of relocating all these arease 


visitors by 1976+« Because the two parks are no more than one 
day's drive apart, it is likely that a large proportion of 
these visitors will visit both parks and so it is reasonable 
to assume that about 75% of the new visitors will at least 
visit Gros Mornee This means that in 1976, Gros Morne wilt 
receive about 48,000 off-island visitorse Extrapolating as 
before, and applying the 12% growth rate for the years after 
1976, it is estimated that there will be 9,500 visitors in 
the first year (12,500 visitor days) rising to 267,000 
visitors (347,000 visitor days) in the twentiethe 

Allocating (on the basis of common sense) the 
proportion of visitors who would participate in each 
category of activities and applying the modified UeSe 
Federal agency procedures, it is estimated that in the first 
year of operation, imputed recreation benefits would amount 
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to $68,000, rising to $1,255,000 in the twentieth yeare 
These imputed benefits are not monetized and so have no 
Linkage or multiplier effectse 


3e Benefits of Economic Stimulation from Tourist Spending 


Every dollar spend by a tourist is a dotlar paid to 
someone else, who wilt in turn spend part of it on his own 
consumption or for his businesse .A part of the original 
dollar is thus spent and respent over and overe This is the 
multiplier effecte This effect in the region depends on the 
amount of each dollar that goes into savings at each stage, 
and the amount that leaks out of the region for importse The 
impact of all tourist spending is increased by the 
multiplier, as is every expenditure by the federal 
government for the establishment and operation of the rparke 
It should be noted as well that the tourist industry can 
have Linkage effects in the regione 

Although tourist expenditure is clearly expansionary in 
the region, if it was diverted from some other part ot 
Canada, it has a contracting effect on the national economy 
SG a provincial gain may leave the nation exactly where it 
waSe For the purposes of this study, it has been assumed 
that any tourist would, if the park had not been 
established, be attracted to some other facility in Canadae 

The value of tourist expenditures, based on visitor 
rates, is estimated to rise from $70,000 in the first year 
to $4,900,000 in the 20th yeare The multiplier is calculated 
to be 1-426 


40 Benefits of Economic Stimulation from Government 


Capital costs of constructing facilities within the 
ctark, and the operating costs, while entered aS a cost in 
the national account, constitute a benefit to the provincee 


This benefit is net of tax payments to the federal 
government (estimated at 2 1/2%) and purchases of goods and 
services outside that province (imports )e (This is 


calculated to be 25% of expenditures )e The net transfer will 
have an expansionary effect on the provincial economy 
through direct spending and through the multipliere There 
are no net benefits to the national account as it must be 
assumed that tunds not used to establish and operate the 
park would have been used elsewhere, perhaps in some other 
regione Discounted at 6% and reduced to 72 1/2% because of 
transfers and tax payments, the capital expenditures are 
estimated at $15,534,647. At 10%, this amount would be 
$16,727,852- 

Federal operating expenditures are reduced in the same 
way and are estimated at present value to be $10,239,827 
(6%) or $7,600,200 (10% )e 
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COMPARISON OF BENEFITS AND COSTS 

Table 3 shows the benefit cost ratios for the park from 
the national and provincial points of viewe From the 
national point of view, benefits are less than costs; and an 
annual comparison shows that the benefits would not begin to 
exceed costs before the 16th to 20th year, depending on the 
alternative boundary chosene These benefits, however, do 
not include any value to be derived from regional 
development or preservatione It may be that the benefit of 
preservation could be considered to exceed the difference 
between benefits and costse An unfavourable benefit-cost 
ratio during a short planning period is characteristic of 
many large-scale projectse Ordinarily, five or at most ten 
years would be considered the short-term horizon beyond 
which benefits should start to exceed costSe The park i8, 
however, preserved for use in perpetuity and the probability 
exists that the benefit cost ratio would be favourable over 
a longer periode 

TABLE 3 


BENEFIT COST RATIOS 
($000"s, present value) 


National Viewpoint 


Estimated Estimated 
Cost Benefit Ratio 

6% 10% 6% 10% 6% 10% 
I 42,731 41,956 19,877 12,768 Oo 46:1 0-630:1 
II 44,763 43,521 19,877 12,768 00e44:1 0-29:1 
Ii! 50,128 48,642 19,877 12,768 0e39:1 O-.26:1 
Fish- 54,264 51,822 19,877 12,768 0e-36:1 0.24:1 
ing 
Prohibited 

Provincial Viewpoint 

8% 10% 8% 10% .8% 10% 
I 8,040 8,326 62,840 54,002 7e81iz1 6e48:1 
II G,647 9,883 62,840 54,002 6.-51:1 524621 
IIt 13,869 13,991 62,840 54,002 4.53:1 3e85:21 
Fish- 175257 16,2957 62,840 54,002 3-64:1 3e18:1 
ing 
Prohibited 

q><><><09 <> 
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From the provincial viewpoint, the benefits of Gros 
Morne Park vastly exceed the costs, most of which are met by 
the federal government and thus transfer purchasing power to 
Newfoundlande New hotel and motel accommodation, restaurants 
and gasoline stations will be required, and many existing 
businessess will benefits However, tourist traffic is 
concentrated heavily in the summer months, and so this 
benefit would tend to be seasonale In many cases, it would 
probably consist of supplementary .family income earned by 
women and young adultse There are; however, social costs not 
accounted for, particularly as far as concerns Local 
residents whose tives have been disrupted by the 
establishment of the parke 


COMMENTARY 


At the outset of the study, Parks Canada was aware of 
the inadequacy of the methodologies reviewed by the 
consultantse Therefore, the consultants were given the 
objective of reviewing existing methodologies and choosing 
or developing one which would adequately and comprehensively 
evaluate the costs and benefits of a park and would be 
adapted to the unique characteristics of the Canadian 
contexte 

They did not realize this objective: in some areas they 
did not develop their ideas enoughe In other areas they were 
not comprehensive or glossed over problemse They proposed a 
modified version of the UeSe Federal Agency procedures; but 
provided no information on the method used to derive the 
values obtainede Consequently, the report provides no 
eEuidance on how to adjust the estimated values in the 
future, or on how to improve the estimating proceduree They 
mention the existence of income multipliers, without 
applying thems and do not deal with employment multipliers 
at alle Finally, and most seriously, they do not adequately 
treat the whole area of intangibles, such as preservation 
benefits or the social impact of a park on a region 
(dislocation and break-up of communities, unemployment and 
the qualification of Local residents for jobs created by the 
park, etce) These intangibles may have a much greater 
importance than any other considerations in assessing a 
parke In some research, it may be acceptable to put aside 
these questions as beyond the realm of economics, but there 
are researchers who have puzzled with them, and Parks Canada 
is frequently called upon to make decisions in which these 
questions play a parte Any methodology which does not dealt 
with intangibles is thus not adequate tor Parks Canada's 
needse 

The responsibility for this shortcoming cannot, 
however, rest entirely on Hildebrandt—Young and Associates 
Ltde At the time of the study project, Parks Canada had not 
as yet drafted a series of guidelines that would have 
permitted close supervision and strict adherence to the 
terms of referencee Had Parks Canada insisted that the 
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consulting firm hire personnel with sufficient expertise to 
develop the required methodology, no study would have been 
done because the answers to many questions still are not 
knowne Doing and analysis that is good for its day and 
within the allowed budget for the work is all that (in the 
end) can be demanded from any consultante 

The research undertaken did make a number of 
substantial contributions to the problem of measuring the 
impact of national parkse For instance, the analysis of the 
three most popular methodologies is comprehensive both in 
its review and assessmente Furthermore, the analysis of 
quantifiable variables such as opportunity costs (commercial 


fishing, forestry, agriculture, mineral development 
potential, etce) and estimates of benefits (construction, 
tourism expenditures, etce ) were both complete and 
enlighteninége Lastly, the analysis of the various 


development plans for the proposed Gros Morne National Park 
were done in such away as to facilitate decision-making 
concerning the most feasible optione 

Thus it must be said that, even though the study did 
not meet Parks Canada's original optimistic expectations, 
the report is a useful document for planning and policy 
purposese The study can serve as a stepping stone toward an 
understanding of what to do and what not to do in other 
research projects of this scope and naturee 
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APPENDIX 


ANNOTATED BIBLIOGRAPHY 


Brazer, Harvey Ee "Outdoor Recreation as a Public Good and 
Some Problems of Financinge" Address Presented before 
the National Short Course on Elements of Outdoor 
Recreation Planninge Ann Arbor, Michigan, 10 May, 1968-4 


Outdoor recreation is a public good only to a very limited 
extent, therefore it should be offered by government to 
users at a price equal to the marginal cost that their use 
imposeSse If outdoor recreation thas constantly declining 
average costs (of production) this scheme will not cover all 
costs, and the remainder may be met either by the government 
or by a charge for facilities accesse Problems of congestion 
costs are also discussede 


Canadian Outdoor Recreation Research Committee, THE ECONOMIC 
IMPACT OF PARKSe Federal Provincial Parks Conference, 
Canada, 19756 


This paper outlines methods now available for the 
measurement of primary and secondary benefits of parks and 
the problems associated with these techniquese Estimates of 
benefits from past studies are included, as is a set of 
administrative guidelines for undertaking a comprehensive 
socio-economic impact analysiSe 


Coomber, Nicholas He and Biswas, Ae Te Key EVALUATION OF 
ENVIRONMENTAL INTANGIBLESe General Press, Bronxville, 
New York, 197Ge 


A review of the state of the art of evaluating intangible 
benefits and costs associated with the use ot the 
environmente The authors synthesize, organize and criticize 
those techniques which have been widely used in the past for 
this purposee 


Davidson, Paul; Adams Gerald Fe and Seneca, Joseph "The 
Social Value of Water Recreation Facilities Resulting 
From the Improvement in Water Quality: The Delaware 
Estuary" in Kneese and Smith, WATER RESEARCH, The Johns 
Hopkins Press, Baltimore, Maryland, ppe 175-214, 1966.6 


The first portion of the article is an argument for public 
intervention in the recreation markete The authors feel that 
this can be justified by the pervasive externalities, the 
existence of otf peak demand that can be provided at zero 
cost, the presence of option demand, and the phenomenon of 
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"Learning by doing". The second portion of the article 
discusses the specific problem of how much recreation will 
be provided by improved water quality in the Delaware 
estuarye The figure is not quantified monetarily. 


Frey, John Ce and Gamble, Hays Be "Policy Issues’ and 
Problems in Outdoor Recreation", JOURNAL OF FARM 
ECONOMICS. 49 (5S) December, PpPpe 1307-1317, 1967. 


Using Pareto Optimality as the basic welfare criterion, the 
authors argue that recreation expenditures will achieve 
maximum welfare only if the pricing mechanism is usede It is 
pointed out that one of the simplest welfare maximizing 
criteria is locating recreation areas nearer population 
arease In some cases an increase in recreation sites results 
in a smaller congestion effect which in turn increases 
recreation at all sitese Finally, they suggest more multi- 
purpose ventures and advocate Joint public-private 
parterships for recreation development. 


Norton, Virgil Je "Discussion: Policy Issues and Problems in 
Outdoor Recreation Economics", JOURNAL OF FARM 
ECONOMICS.e 42 (5) December, 1967. 


The author differs with Frey and Gamble in arguing that 
merit and redistribution effects of recreation are too 
important to advocate policy on the narrow grounds of Pareto 
optimalitye He also notes that present measurement schemes 
are inadequate for accurate benefit-cost results. 


Robinson, Warren Ce "The Simple Economics of Outdoor 
Recreation", LAND ECONOMICS, 43 (1) February, ppe 


Robinson argues that outdoor recreation should be publicly 
provided on the grounds that it is a merit servicee However, 
it is a merit service not for distributional reasons but as 
a result of social judgments in favor of a type of quality 
control that will not be provided by private marketse Since 
the merit service is not re-distributional in nature, it can 
be pricede The optimal pricing mechanism can be established 
in the short run by setting marginal cost equal to price 
while employing taxes or subsidies on complementary goods to 
adjust demand to the designed capacity of the facilitye 
Other rationing and pricing schemes are also considerede 
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Schmidt, Allan Ae "Economic Analysis of Water Resource 
Problems: Non—-Market Values and Efficiency of Public 
Investments in Water Resources", AMERICAN ECONOMIC 
REVIEWe 58 (2) May, pre 158-168, 19676 


A weneral review of the problem involved in measuring 
“intangibles"e Schmid argues that all goods can be priced 
as they are implicitly priced whenever a decision is made 
regarding theme Following this point is a suggestive and 
non-technical discussion of public goods; externalities, and 
non-marginal changee Externalities and public goods are 
problems involving property rightse Many environmental goods 
constitute new products and thus present problems of non- 
marginal change and price adjustment throughout the economye 
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CORD STUDY TECHNICAL NOTE 40 


EVALUATION OF THE iMPACT OF THE 
CREATION OF A NEW NATIONAL PARK? 
A LONGITUDINAL STUDY 


Je Beaman and Le Lehtiniemi 


ABSTRACT 


The creation of a national park has long-term effects 
upon the physical and social environment in which it is 
locatede Responsible expenditure of public moneys dictates 
that efforts be made to maximize beneficial outcomes and to 
avoid the adverse effects of park dewelopmente Officials 
responsible for the establishment of parks should rely upon 
whatever tools are available for predicting the consequences 
of decisions takene 

| This paper examines the use of economic impact studies 
by the creators of National Parks in Canadae First, the role 
of economic impact analyses in the sequence of events that 
has recently been typical of the creation of new national 
' parks in Canada is sketchede Next, references reflecting the 
|state of the art are noted and two types of impact 
assessments are distinquishede A case study representative 
|of the two types is presented, showing how and why the 
(events that actually took place in developing a specific 
‘national park differed from what had been predictede 

The paper suggests how, in the Light of Parks Canada 
experience, economic impact studies can be refined and 
improved, and the role they should play in future park 
developmente 


INTRODUCTION 


| The creation of a national park has long-term and in 
‘many ways irreversible effects upon the physical and social 
environment in which it is lLocatede Responsible expenditure 
of public moneys dictates that efforts be made to maximize 
beneficial outcomes and to avoid adverse effects of 
development of a parke Cfficials responsible for the 
establishment of parks should therefore rely upon whatever 
tools are available for predicting consequences of decisions 
taken in the presente 

Although proven techniques for assessing the impact of 
developing a major park cannot yet be claimed, progress is 
being madee Some aspects of this progress are clearly 
documented in this volumee This paper not only cites these 
developments but points up some interesting results of 
studying an area from before a national park was created 
(1964) until the park was in full operation (1973 )-« 
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It is by detailed analysis of what happened with 
respect to one park in particular that this paper frankly 
examines the use of economic impact studies by the creators 
of national parks in Canadae The role of economic impact 
analyses in the sequence of events that has recently been 
typical of the creation of new national parks in Canada is 
first described for those not familiar with ite However, the 
main thrusts of this presentation are directea toward 
suggesting how, in the Light of Parks Canada's experience, 
economic impact studies can be refined and improvede 
Possibly of even more importance are reflections on what 
role such studies should play in development of new parks 
from conception to implementation and operatione 


THE ROLE OF ECONOMIC IMPACT STUDIES 
IN THE PROCESS OF CREATING A NATIONAL PARK 


The creation of a national park in Canada involves 
bilateral negotiations between the Federal and Provincial 
levels of Governmente The Federal Government desires to 
realize certain national objectives in developing a national 
parke Because the Federal Government ultimately requires 
title in fee simple to the land that will become a national 
park, and the land must be transferred to it by the province 
free of any encumbrance, a strong bargaining situation 
arisese Provincial Governments, in exchange for granting 
title and relinquishing the right to resources within the 
park, seek to ensure that proposed parks are compatible with 
and will enhance regional growth and development within the 
province, or help attain some other provincial ob jectivese 
It is within such a context of intergovernmental relations 
that economic studies of proposed nationat parks are 
conductede 

The sequence of events related to creating a national 
park involves definable phases (see Figure 1) and normally 
extends over several yearse The first step of relevance to 
this discussion censists of the preparation of a concept 
plan that outlines in general terms where the park will be 
Located, something about its sizey the features to be 
highlighted ana the type of development to be undertakene 
This plan may involve only one or several options for the 
development of a parke Although it is customarily the 
Federal Government which prepares the plan, the process of 
concept plan formulation is dynamic, characterized by 
discussions and "tailoring" of the plan in the interests of 
producing an end product that appears viable and mutually 
desirablee 

It is only atter the discussion of the concept plan has 
evolved to a certain stage that an economic impact 
assessment is normally comissioned to evaluate the viability 
of implementing the plan(s)e At this stage an economic 
impact study is used to determine if or on what basis a park 
proposal should be moved beyond the concept plan stagee In 
such a study one evaluates what is only, in fact, an outline 
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of what might bee Given that many details may be modified 
prior to final development of the park, it is necessary to 
deal with approximationse Because the analyst is charged 
with evaluating the impact of a development as outlined in 
the concept plan, the effects of deviations from that plan 
or the effects of exogenous influences not considered in the 
study are seldom pursuede 


% 


FIGURE 1 
THE SEQUENCE OF EVENTS IN PARKS DEVELOPMENT 


le Proposal for a Park Considered Internallye 

Ze Bilateral Negotiations Begine 

Je Preparation of the Concept Plan for the Parke 

4e Economic Impact Assessment of the Proposed Parke 
Se Evaluation and Reaction to Economic Impact Studye 


Ge Agreement in Principle to Create the Parke 


Te Preparation of Master Plan for Park Developmente 
Se Development of the Parke 
OT OTOTOTLO, 


If a proposal to develop a park is judged to be viable 
on the basis of the economic impact assessment, formal 
negotiations between the governments involved will proceed 
and possibly pursue new development alternatives before an 
agreement in principle is reachede During this process, it 
is likely that the decisions and compromises made wilt 
modify some aspects of the original concept plane For 
example, if both parties do not concur about the viability 
of the proposed park, the negotiations. could involve 
appropriate land transfers or other compensation to promote 
agreemente 

After a tormal agreement to create a park has been 
struck, detailed planning commences, carrying a high 
potential for further modification of the concept plan upon 
which the economic impact analysis of a park was basede In 
this regard it should be noted that there is no mechanism to 
ensure that the final design of a park adheres closely to 
the concept plan on the basis of which the park was judged 
to be viablee The consequences of changes may become evident 
only after the park is developede 
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ECONOMIC IMPACT STUDIES 


With respect to the work done for Parks Canada, two 
types of economic impact analyses can be distinguishede The 
type that has usually been employed in evaluating proposed 
national parks in Canada consists of a single predictive 
assessment of the outcome of developing a4 park (see TN 39 
for one example)e Among these a priori studies there can be 
wide variation in the approach to evaluatione Some focus on 
secondary benefits only, others attempt to include an 
evaluation of the primary benefits to future park userse The 
accounts prepared (locally regional, provincial) and the 
degree of detail and comprehensiveness of these studies are 
variablee Details are not provided here because many of the 
studies prepared relate to park proposals stilt under 
consideratione 

The state of the art of impact assessment to 1970 has 
been summarized for Parks Canada as part of an evaluation of 
the proposed Gros Morne National Park in Newfoundlande 
(Again see TN 39 and relevant introduction and review 
material for Chapter VIII-) In this report Hildebrandt—Young 
(see Reference 27) reviewed techniques used by others and 
available to those conducting economic impact analyseSse 
Their bibliography includes more than 160 items which deal 
with public finance and economic problems of national park 
developmente The Coomber and Biswas monograph “Evaluation of 
Environmental Intangibles: Review of Techniques" (see 
Reference 18) prepared for Environment Canada provides a 
complementary review which focuses almost exclusively on 
primary benefitse 

In constrast to a priori studies are those which 
examine the consequence of creation of a park after it is in 
operatione These analyses measure what really happened as a 
result of developing a park and compare actual outcomes with 
what had been predicted in an a priori studye Apart from the 
timing, such studied are similar to predictive studies in 
terms of the variations possible in scope, details and 
comprehensivenesse However, findings from tollow-up economic 
impact assessments can suggest how predictive models might 
be improvede The only follow-up study of the impact of 
creating a National Park in Canada to date is that of 
Kejimkujik National Park, which is reviewed in some detail 
heree 


THE KEJIMKUJIK STUDIES 


In 1962 an area approsimately 150 square miles 
surrounding Kejimkujik Lake in southwestern Nova Scotia was 
proposed for development as a national parke AS one stage in 
the creation of the park an a priori study was commissioned 
to assess the economic impact of the park on the area judged 
to be within normal commuting distance to the park for 
reasons of employmente The objectives of the Economic Survey 
of the Kejimkujik Park Area (see Reference 28, 29) were to 
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(a) provide bench-mark data on the economic conditions of 
the area to enable comparison with future conditions 
resulting from a change in resource use, and (b) to 
determine the economic justification for a specified change 
in resource use (ieee for the establishment of a national 
park) in terms of the total benefits to be derived relative 
to total costse 

The study did provide income, employment and property 
values as Parameters of economic conditions, and a 
methodology for reassessment of these at a later datee 
However, without having attempted to measure the benefits of 
creating a park, but having Suggested procedures that might 
be employed in doing so, the report concludes with a 
favourable "estimation" of the prospects for success of the 
park as an economic stimulante In terms of how the authors 
of this paper suggest an evaluation should proceed, the 
study would be harshly critiqued. It should be borne in 
mind, though, that the study was carried out in 1964, and 
methods for impact analysis have improved since thens 

Regardless of the shortcomings of the formal study, the 
history of the development of Ke jimkujik serves to point up 
how and why even a rigorous, expensive and thorough analysis 
might have failed to predict what has happened theree In 
this regard one should note that no control mechanisms 
existed to keep park development on the right "trajectory" 
to fulfill and realize the expectations for the park that 
had been expressed in the concept plane 

After agreement in principle had been reached, a Master 
Development Plan was formulated in 1965 to delineate 
facility Location and development zones within the parke The 
plan was based on an expected volume of visitor days and a 
number of rental accommodation units to be available outside 
the park boundaries by 1970. The volume-of—use estimate was 
based on use figures for the three existing national parks 
in the Maritime Provincese It was assumed that the origins 
of visitors to Kejimkujik would be similar to those of 
visitors to Cape Breton Highlands National Park which is in 
the opposite end of Nova Scotia to Kejimkujik (see Figure 
2 de Moreover, it was assumed that Nova Scotia residents 
would use the new park as much as New Brunswick residents 
used Fundy National Parke By considering these factors, a 
need for 1000 campsites within the park was foreseene The 
need for much private sector accomodation outside the park 
was acceptede Unfortunately, details on .the basis for 
estimating accommodation requirements are not knowne 

Kejimkujik National Park was officially opened in 1969, 
four years after the first master plan was preparede 
Although the master plan was based on an estimate of 210,000 
entrants by 1970, the actual use volume in 1971 was only 
about 150,00 visitorse The number of campsites that had been 
developed by 1971 was only 330, less than one third of what 
had been plannede 

Because of a Parks Canada moratorium on campground 
development in the park, 330 sites was not to be exceeded in 
order to encourage campground development near the park by 
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the private sectore However, even in 1973, the Park's 330 
campsites seldom reached 75 percent occupancy on week daySe 
Admittedly on weekends in July and August the park's 
campsites were usually filled to capacitye Turning away 


potential users was often a probleme Still, it icf 


interesting to note here that although the Ke jimkujik 
campgrounds are usually full by 9 peme Friday evenings; one 
private campground just outside the park has only 
“reasonable" occupancy on "some". weekendse Other. major 
campgrounds in the area are used mainly on weekends, but 
these have their own resource base and cater to people who 
reserve sites on which to park trailers for the whole 
summere 

The characteristics of visitors to the Park are quite 
different from what had been expectede Nova Scotia residents 
constituted more than 69 percent of all park users in 1973; 
whereas it had been predicted that they would amount for 
approximately one third of the visitors to Kejimkujike In 
contrast, residents from the other Canadian provinces 
accounted for only 11 percent of the users; much less than 
the 33 percent that had been expectede From the perspective 
of “user—days", Nova Scotian and other Canadians represented 
67 and 12 percent respectively of the total for the Parke 
Lastly, it should be noted that just less than one half of 
all visitors to Kejimkujik National Park in 1973 were day 
userSse 

Five years after the opening of the park, and while the 
events just described transpired, the Institute of Public 
Affairs (see Reference 28) conducted a second study _ to 
measure the actual socio-economic impact of the park on the 
local areae The study replicated in part the surveys carried 
out in 1£64.« Comparisons were made of income and employment 
before and after creation of the parke However, an 
assessment of property values was not repeated in the 
follow-up study, although changes in property value have 
been recognizea as important in determining the impact of 
such developments (ecege see Reference 29). 

The 1974 Kejimkujik National Park Socio-Economic Impact 
Study (see Reference 29) concluded that the park had not 
been a major generator of regional economic growthe It makes 
it very clear that the park did not have the developmental 
impact initially anticipatede The study reports that, as of 
1973, the park provided direct and indirect employment for 
113 people,(&3 of whom were part—time workers) In the 
assessment, the Loss ot 33 part-time jobs at tourist 
facilities which were closed as a result of the park is 
recognized, but the 1974 review does not include estimation 
of foregone employment from the forest industry in computing 
the net employment benefits, nor does it acknowledge the 
loss of use of the land for hunting and trappinge 

What is more, it is not emphasized in the follow-up 
assessment study that only 17 of the 30 full-time jobs and 
Da nok the 48 part-time positions which constitute the net 
employment effects were filled by local residentse The 
skilled workers used to administer the park and many of its 
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programs were imported from outside the Regione Stitl, the 
report does conclude that the large number of part—time Jobs 
at the park does not make for a stable economic climate, but 
no indication is given as to what increase or decrease in 
_stability resulted from developing the parke 


The net income impact of the park in 1973 was estimated 


as $384,295, using the income multiplier of 1-108 derived in 
the 1974 follow-up studye But one should note that in 
calculating the impact and the multiplier, the wages paid to 
employees who were brought in from outside the area as park 


staff by Parks Canada are includede It might have been more 


|appropriate to include only the local expenditures of these 
| “imported" employees when computing the regional benefits of 
|' creating the parke 


Inclusion of a consumer surplus value per visit by Nova 


| Scotia residents in 1973 raises the amount of benefits to 
the province for that year by $693,000 when conservative 
| estimates of consumer surplus values are used (see TN 38 for 
Canadian consumer surplus values). The computations used toa 
derive this primary benefit value are shown in Table ie 


TABLE 1 


CALCULATION OF PRIMARY BENEFITS 
TO NOVA SCOTIA RESIDENTS 
USING KEJIMKUJIK NATIONAL PARK 


Type of Number Consumer Primary 
Use of Users Surplus Benefit 

( persons )! | Value? Value($) 
Day use 239 49 5200 119745 
Overnight use 4839 9200 43551 
Camping Trip 15136 35-00 529760 
693056 


1e From Park Visitor Survey, 1973. 
2e Adapted from values in Table 1, TN 38-6 


PO LOLOLOLS, 


Lest an unfavourable impression about the wisdom of 
creating Kejimkujik arises, some additional comments merit 


attention in passinge Although the economic assessment did 


not consider primary benefits, analysis of the 


characteristics of users of the park has Shown that almost 
70 percent of the visitors are residents of the province. 


Moreover, the development of Kejimkujik retains in the 
province the expenditures of those Nova Scotia residents who 
would otherwise travel out of the province to seek the type 
of experience that the park offerse Finally, neglecting to 
consider 1973 property values in the area with respect to 
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real capital gain resulting from development of the park is 
only part of a failure to consider the gains derived from 
creating a park in 1964 that is sustaining employment in the 
Local area and which now serves as an important element in 
developing the tourist potential of southern Nova Scotiae 


SUGGESTION FOR IMPROVING ECONOMIC IMPACT ANALYSIS 


It should be obvious that the studies of Ke jimkujik 
were not reviewed as examples of research that should be 
used as models of how to analyze the impact of proposed 
parkse Also the other CORD Study impact analysis is not 
endorsed as a model to follow in doing other analyses (see 
TN 302)06 The case of Kejimkujik was presented because it is 
not typical of socio-economic impact studies of National 
Parks in Canada in one important respect -—- it involved a 
follow-up assessment of the impact of the creation of a 
major parke Having that salient feature, it serves as a 
Springboard to a more general discussion of such studies and 
how these should be refinede 

Two main thrusts are suggested as necessary in 
improving studiese One involves advancing the state of the 
analysis art and thus enhancing the quality of assessment 
studiese This is desirable in its own right, even if the 
function served by impact studies remains unchangede The 
other thrust, which the authors believe the "Kejimkujik" and 
other studies show is necessary, entails promoting a closer 
and continuing interplay between the persons doing the 
impact analyses and the park planners and aeveloperse The 
objective is to promote Longitudinal assessments throughout 
the process of park creation and even until after the park 
has been developede Naturally, efforts should be made to 
integrate the two types of improvementsSe 

The sequence of steps shown in Figure 3 relate to the 
first of the two thrusts just cited —- namely, ensuring high 
quality in economic impact studiese The first two steps 
involve clarifying and articulating the objectives of a park 
and thereby making more specific the development options to 
be considerede The statements in the figure suggest that it 
would be important early in a study to identify the expected 
relationships between park development and activities in the 
private sectore The third step consists of delineating what 
economic and social elements should be included in the 
socio-economic study accounts to measure the extent to which 
the objectives delineated in step 1 are attainede Although 
rrofessional judgement, the prerogative of the client and 
the peculiarities of each situation will always be deciding 
factors regarding what is included, it is suggested that a 
general checklist of elements that should be considered in 
any socio-economic impact analysis would be helpfule Such a 
List would serve as a focus for discussion between the 
analyst technically responsible for an impact study and the 
“user client". 

The remaining steps Listed in Figure 3 relate directly 
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Figure 2 


Recommended Steps in Preparation of an Economic Impact Assessment 


TN 


Identification of Objectives of Park Creation. 

Specifications of Development Options of Concept Plan. 
Delineation of Economic and Social Aspects to be Studied. 
Selection of Base Data re: Use, Expenditures, Multipliers, etc. 
Definition of Projection Procedures to be Used to Generate 
Data Required to Estimate Future Impact. 

Preparation of the Accounts by which the Concept Plan or 
Development Option will be Evaluated. 

Specification of the Likely Magnitude of Error or Expected 
Range of Variation of Various Impacts. 

Preparation of a Longitudinal Evaluation Plan that Identifies 
Critical Indicators to Monitor, and their Predicted Values and 
Acceptable Range of Variations, for Ensuring that Economic 


Objectives of Developing the Park are Attained. 
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to the accuracy of impact assessmentse Their importance has 
become apparent from Parks Canada's experiences, some of 
which are cited in the next section, and from work done as 
part of the Canadian Outdoor Recreation Demand (CORD) Studye 

Probably the most important thing to note is that the 
final step suggested in Figure 3 involves creating a 
mechanism to promote continuing assessment of the impact of 
each modification to the concept plan during the possibly 
protracted process between first consideration of a park and 
its being in operatione The identification of critical 
indicators to be used in evaluating modifications could be 
the necessary tool to use in exerting the strong pressures 
needed to keep park development consistent with the 
objectives that were sought through its creatione In this 
regard one should note that many impact assessment studies 
have been unfairly judged in retrospect because they were 
based on a concept plan which was extensively transformed 
after the studies were completede Preparation of a 
Longitudinal evaluation plan also leads to a redefinition of 
the role of impact assessment studies in the process of park 
creation, which is the second thrust to be considered in 
this presentatione 

Figure 4 presents headings and a flow sequence which 
aid one in seeing how the process of creating a park might 
be elaborated to include a continuing relationship between 
impact forecasting and facility planning and developmente It 
is characterized by feedback loops which suggest the 
necessary evaluation of the effects of implicit or explicit 
modifications to the concept plan of a park at various 
stages in the creation of a parke The first feedback Loop 
relates to assessing the consequences of any major changes 
that result from the bilateral negotiations prior to signing 
a formal agreement, and identifying what effects these 
changes may have upon the accounts prepared in an original 
economic impact studye This reassessment would logically 
precede finalization of an agreemente The second feedback 
Loop is included to focus on the need to ensure that the 
Master Development Plan is consistent with the objectives of 
creating the parke In practical terms it points up the need 
to form a Linkage between the policy management functions of 
park acquisition and the planning functions within an 
organizatione Finally, the figure implies that if a park is 
to attain objectives set for it during a negotiation process 
a monitoring process must be introduced to ensure that the 
etfects of modifications to the Master Development Plan are 
evaluated and that such changes are consistent with the 
desired results of creating the parke 

In addition to building in an on-going evaluation 
process to aetect potential unanticipated impacts, a follow— 
up economic impact assessment when development of the park 
has been completed is implied as desirable by the plan 
showne This study would be valuable even though a 
Longitudinal assessment procedure was introduced to document 
unanticipated effects and to point out what improvement in 
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FIGURE 3 


RECOMMENDED STEPS IN PREPARATION 
OF AN ECONOMIC IMPACT ASSESSMENT 


1e Identification of Objectives of Parks Creatione 
2e Specifications of Development Options of Concept Plane 
Je Delineation of Economic and Social Aspects to be Studiede 


4-e Selection of Base Data Re: Use y Expenditures, 
Multipliers, etce 


Se Definition of Projection Procedures to be used to 
Generate Data Required to Estimate Future Impacte 


6¢ Preparation of the Accounts by which the Correct Plan or 
Development Option will be Evaluatede 


Je Specification of the Likely Magnitude of Error or 
Expected Range of Variation of Various Impactse 


8e Preparation of a Longitudinal Evaluation Plan that 
Identifies Critical Indicators to Monitor, and their 
Predicted Values and Acceptable Range of Variations; for 
Ensuring that Economic Objections of Developing the Park 
are Attainede 


<>< >< ><> <> 
impact could be facilitated by future park developmente 


DISCUSSION 


Actually, the variables being considered and the kind 
of equation systems being developed to define use at a given 
point in time (base figures) and to make projections gives 
one an insight into the problems involved in.making reliable 
use estimates and realistic projectionse Figure 5 Lists some 
of the exogenous and endogenous variables that are relevant 
in predicting the dynamics of the development of Kejimkujik 
National Parke There was no attempt to be exhaustive in 
making the Lists but rather a variety of variables were 
identified (see Appendix)e In the Appendix, the variables 
Listed in Figure 5 are discussed to provide an indication of 
why and how these variables are relevant to the Kejimkujik 
Park developmente 

Regardless of the complexity involwed in stating a 
specific set of equations to model park use, by looking at 
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FIGURE 4 


A PROPOSED POLE FOR SOCIO-ECONOMIC IMPACT 
STUDIES IN THE FARK DEVELOPNENT PROCESS 


Preparation of the Concept Plan 


Econamic Impact Assessment (Define Critical Indicators) 


oo Evaluation and Reaction to Impact Assessment 


Negotiations to Resolve Problem(s) (If Required) 


oo oe 


ESE SE EA 
ee Evaluation of the Effect of Revisions to Concept Plan ee 
Upon Economic Assessment (Critical Indicators) | 

J 

i 

Agreement to Concept(s) of Park Development A 

| 

| 

| 

Preparation of Tentative Master Development Plan | 

i 

| 

Review to Identify Where and How Master iba 


Development Plan Varies from Concept Plan 


Acceptance of Master Develogment Plan 


Cammencement of Park Development 


Monitoring of Deviations fram Master Development Plan 


Evaluation of Effects of Modifications Upon Impact Assessment 


Revisions to Master Development Plan 


rr Negotiations tc Resolve Problems 


Pollow-.3. pcencmc Imoact fsesessonent 


TN 40 CHAPTER VIII page 754 


Figure 6 one can get a fairly clear picture of the kinds of 
relations between variables which are seen to be of concern 
by Parks Canadae The flow chart in this figure shows the 
“directions of causality" implied by events that have been 
observed by Parks Canadae 

In particular one may note that Figure 6 shows that a 
change in an exogenous variable from series 1 of Figure § 
results in a change in one or many of the classes of use of 
a park (series 3 of Figure 5)e The structure of the figure 
then suggests that changes in use levels operate to 
influence the development of new facilities or the 
modifications of old facilities to meet use demandse As 
well, the structure of the flow chart implies that the 
modifications feed back to affect the amount of use of 
facilities just referred to (series 2 variables affecting 
series 3)e Similarly, use causes’ the development of roads 
and thus development feeds back to cause usee The ultimate 
desirability of considering the tead-lag factors that affect 
reaching an equilibrium is admitted but the problems 
involved are not pursued here. 

A third series 2 variable (263 of Figure 5), which is 
somewhat different than the preceding two, is the class of 
crowding variables. Here the situation implied by the way 
Figure 6 is drawn is that increased use causes changes in 
crowding, but that as well, facility development changes 
crowding conditionse One could say that a 3-way feedback is 
implied that results in some kind of an equilibrium between 
crowding, use and the development of facilities. 

Figure 6, by its simplicity, suggests that modelling of 
park use is much simpler than it is in realitye Figure 7 is 
presented because by examining it one begins to see the 
fantastic complexity of interrelationships between variables 
that must be approximated if structurally adequate models of 
the dynamics of parks use are to be defined for use in 
economic and social impact studiese 

The concern with adequate approximations has been 
introduced as an issue because, as one can guess by Looking 
at Figure 7, it is simply not possible, with the amount of 
data that are available under normal budgetary and time 
constraints, to develop a set of 40 or more equations 
(unless Parks Canada over the long run develops a 
generalized modelling system easily applied to particular 
problems )e Even when the equations indicated are adequately 
defined, there are problems in carrying out an estimation 
exercise to obtain the parameters for these equations (see 
TN 11 and comments on parameter estimation problems that are 


made there )e Actual Parks Canada projects have invariably 
involved only a few equations and to this point in time have 
not involved estimation of parameters when capacity 


constraints are actually enforced on an equation systeme 
There has been a need to be concerned with the limited 
amount of data available, the quality of data (see eege TN 
21), the proper form for individual equations (TN 35, TN 33, 
TN 30, TN 18, TN 1 and TN 3), the use of appropriate and 


TN 40 CHAPTER VIII page 755 


FIGURE 5 


EXOGENOUS VARIABLES 


(1,1) The development of Nova Scotia Highways 
affecting the access to KejimkuJjik National 
Park (independent of the use of Ke jimkujik )e 


(1,2) Level of development of facilities outside 
the park independent of the park (facilities 
not even necessarily in Nova Scotia )e 


(1,3) Creation of a new National Provincial or 
Private Park independent of the pressures of 
Ke jimkujike 


(1,4) Policy decisions affecting the interest rate 
on funding of tourism development (ecge 
creation of available low interest tourism 
development funds by the Government of 
Canada )e 


(1,5) Weather: (a) series of bad seasons resulting 
ina change in the tendency to go to the 
park at all in future seasons (b) weather 
conditions influencing visiting on a given 
weekend or weekdaye 


(1,6) Infestation of pests (causing an undesirable 
impression of the park which must be 
overcome in subsequent seasons )e 


(1,7) National Parks policies such as the "Freeze" 
on campground development or administration 
policies affecting the amount and quality of 
beach and wilderness available or how these 
areas are usede 


(1,8) Population shifts independent of Park 
creatione 
(1,9) National or Regional economic change 


affecting the disposal income available for 
trips to parkSe 


(1,10)Trends which involve drastic departure from 


past behaviour (eege are not explained by 
trends in past trends )e 
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ACTION ENDOGENOUS VARIABLES 


(2,1) Development of new facilities or 
modification of old facilities to meet user 
"demand". 


(2,2) Development of new roads or imporvement of 
old roads to meet weekend or weekday park 
lLoade 


(2,3) Crowding or overuse of: aeactivity areas; 
bebeaches3;5 and cecampgrounds - because of 
weekdays and/or weekend usee 


USE CONSEQUENCE ENDOGENOUS VARIABLES 
(3,1) Local weekday usee 
(3,2) Local weekend camper usee 
(3,3) Other intra-provincial weekday usee 
(3,4) Other intra-provincial weekend camping usee 


(3,5) Other uses (eegey beach use or interpretive 
facilitiese 


OL OLP LOLS, 


efficient estimation procedures for single equation models 
(TN 19) and other issuese 

Actually, a number of CORD studies implicitly point out 
the need for the kinds of use estimation equations referred 
to in Figure 7 (see TN 1, TN 4, TN 8, TN 18, TN 30 and TN 
33)-e The basis on which the need for these equations is 
defined, is both by reference to the research of others and 
by the discussion of specific situations that illustrate why 
models of a certain type are appropriate to a given 
Situation where it is possible to see that other models are 
not appropriatee 

One should note that the commentary on Figures 5, 6, 
and 7 has not raised the issue of making projectionse 
Obviously the exogenous variables (series 1 variables of 
Figure 5) were introduced because they should be part of the 


input to a set of "dynamic" demand equationse However 
progress towards developing projection equations has not 
been in the area of deriving equations systems for 


individual sitese Effort has focused on estimating aggregate 
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FIGURE 6 


THE GENERAL NATURE OF RELATIONS 
DEFINING PARK USE 


A change in a variable in the '1,' series of 
Figure 5 (eege 1,1 )e 


| 
~/ 
Results ina change in some or many types’) of 
userSe (eege classes 3,1 3,2 of Figure 5) 


Vv 


Which relates to the interaction of series 2 
variables of Figure 5 (ieee 231 252 and 233 )e 


<> <9<9<9<5 


demand by people at various origins (see TN 6, TN 12, TN 13, 
TN 20 and TN 36 )e 

Parks Canada work on projection of aggregate demand 
(origin moaelling) cannot be described in detail here 
because there are too many notational complications, 
estimation problems requiring comment, etce to allow a brief 
review to be meaningfule Stitl one may wish to note that 
CORD Study research has considered the type of demand models 
that Knetsch proposed for use (TN 34) and other similar 
modelse The results of extensive research (starting with 
model formulation in TN 12) have been: 


(1) to develop formulas for predicting the 
accuracy of estimates if a model is 
structurally acceptable (TN 6), 


(2) to show structural problems with the models 


being considered and suggest solutions (TN 
20), 


(3) to show the value of R2 that is to be expected 


in the kind of modelling being done (TN 36), 
and 
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FIGURE 7 


DESCRIPTIVE SUMMARY 
OF EQUATIONS FOR PARK SUMMER SEASON USE 
OF KEJIMKUJIK NATIONAL PARE 


Ae USE EQUATIONS} 


Equation 

Le Day-use; immediate local area for weekdayse 

Ze sspancelilupspage immediate local area for weekend and Long 
weekende 

3e Campground use; immediate local area for weekday 


(non-extended holiday weekday use )e 


4e Campground use; immediate local area for weekend days 
and long weekend. * 


5e-S8e Same equations for holiday use of the park (non- 
wilderness holiday use )e 


9e-10- Wilderness use equations; for weekend days (9) and 
holidays (10).? 


11.--20eSimilar equations for use of park by non-local people 
who are from rest of Nova Scotiae 


2142-30eEquations to estimate use in the same categories by 
non-Nova-Scotian residents of Canadae 


31--40-eEquations to estimate use in the same categories by 
UeSeAe residentse 


Be CONSTRAINT EQUATIONS* 


41e Managed campground capacity constraint (campground 
use on a given day cannot exceed capacity )e° 


42 Administrative control of wilderness usee ® 


436 Facility and service capacity constraints (ec«ge 
parking availability for day—use;3 picnic area 
capacity constraint’; capacity for guided hikes; 
interpretive films, tours, lectures, etc; and the 
like )e 


1e This figure does not explicitly introduce "en route", 
“main-destination" and other descriptions used in 
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defining equations discussed or “estimated" in CORD Study 
research pro jectSe One can consider that a full 
specification of all possible equations that should be 
considered would include, for example, en route and main-— 
destination use equations for holiday use of Canadian 
National Parks by UeSe residentse 


2e Independent variables should often include weekday 
loading and weekday turnover. to reflect the . space 
available for "new" weekend use, or these considerations 
must be built into the constraint equationse 


3e The omission of weekday equations means’ that it is 
considered that "true" wilderness use of a National Park 
by a person cannot occur after work on a weekdaye 


4. As indicated in Figure 6, crowding is not viewed as a 
constrainte Rather, crowding variables are considered 
endogenous variables unless crowding control variables 
are introduces administratively as indicated in Equations 
41 and 42. 


5e See operations policy directive — on the use of overflow 
campgroundse 


6« Parks Canada does not presently exert specific controls 
on the number of users in wilderness areas such as-~ the 
Ncontrolled" maximum number ona trail that the UeSe Park 
Service uses at Yosemite National Parke 


Je The work presented in TN 16 thas been greatly extended by 
Ontario and includes definitions of capacities for a 
number otf activitiese 


OL OSLO LOLS 
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(4) to suggest a quantitative projection strategy 
and problems with this strategy (TN 13). 


CONCLUSION 


The title of this paper may have been misleading 
because the two economic studies associated with Kejimkujik 
National Park were used as a focus for a more general 
examination of impact assessments of major parks in Canadae 
The main concern has really been with deficiencies in impact 
studies and how these may be overcomee 

Although numerous papers that offer improvements of 
models for use estimation were cited, it is not implied that 
the major methodological problem with economic impact 
studies of park development in Canada rests with the 
mathematical and economic models usede The importance of 
technical references is to communicate the current emphasis 
in use estimation of differentiating the various types of 
use that are made of an area depending on how it is 
developed, and to draw attention to assessment of the 
accuracy of estimatese Improvements in these technical areas 
have very practical and obvious applicationse 

In the view of the authors it is important to stress 
that the usefulness of economic impact assessments can be 
increased by redefining the role they play in a longitudinal 
processSse So; in closing, the authors wish to reiterate their 
view that even if a very "sophisticated" study of Kejimkujik 
had been carried out in 1964, the results would have proven 
to be disappointing. Regardless of what methods might have 
been used to make estimates, there were at that time almost 
no suitable data available from which to generate use 
estimates for some base year, nor was there trend 
information on park use that could have been projected into 
the futuree In addition, much of the methodology now 
available was developed only after that studye But most 
importantly the need to build longitudinal follow-up into 
the impact assessment of park development had not yet been 
recognized as criticale 


APPENDIX ‘ 


The reader may find it interesting to note why some of 
the exogenous variables that have been Listed (see Figure 5) 
have been indicatede In particular, he may be surprised to 
find out that all of these are exogenous variables that have 
influenced the development of Kejimkujik National Parke 
Specifically, the extension of a four-lane highway south 
from Halifax, which has been carried out independently of 
the development of Kejimkujik National Park, does have a 
tremendous impact on the accessibility of the park both for 
people who do not come from Nova Scotia and for Halifax 
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residents who wish to go to the parke 

Development of facilities outside the park independent 
of development in the park is a catch-all for impacts that 
could have, or did ooccure Certainly one can refer to the 
development of the highway, cited as lel, as falling under 
1.2 but a development which is much more in the spirit of 
what is referred to would have been the development of a new 
national park which at the time it was being considered was 
to be called Ship Harbour National Parke Negotiations on 
this park proceeded to the point of placing a land freeze on 
the land that was to be used for development of the park and 
it was only as a result of a great deal of public pressure 
that the Ship Harbour Park was not createde Had this park 
been created, the many users of Ke jimkujik who come from 
Halifax would have had a place they could go which was much 
closer to Halifax and consequently, a drastic shift in the 
use of Kejimkujik National Park could have been expectede 

Exogenous variables 1-5 and 1-e6 refer to weather 
conditions and pest conditions that are well known to users 
of the Atlantic Region National Parkse Actually, one year in 
the Maritimes may be lovely then several years of bad 
weather can serve as a real discouragement to people who 
make their plans and go away for a holiday, onty to end up 
having 2 weeks of raine Similarly, the infestations of bag- 
worms in Ke jimkujik National Park in 1973 and 1974 have left 
many people with an impression of the park that guarantees 
they will choose to go somewhere else where they will not 
continuously be wondering if one of these pests will fall 
from a tree onto them or into the food they are preparinge 

Variable 1e7 points up the importance of policy 
decisions, such as a freeze on campground development in 
determing what use is made of a given national parke The 
preceding discussion has pointed out that original plans for 
Kejimkujik involved 1,000 campsites being built within the 
parke Obviously a policy decision that has held down the 
campsite development to 330 campsites has had an influence 
on the amount of use that has occurred at the parke 
However, in stating this, one must recognize that the 
decision to freeze campground development was not an 
endogenous decision based on lack of use3 it was an 
exogenous decision based on considerations of the best ways 
to achieve regional development objectivese 

Similarly, one can expect administrative policies 
atfecting the amount and quality of beaches and wilderness 
to affect the use of National Parkse Here again, it is 
important to note that the decisions referred to are not 
decisions in response to present uSey, but management 
decisions based on considerations external to the particular 
parke 

The variables listed as 1e8 through 1-10 are quite 
possibly the ones that would have been expected ina List of 
exogenous variables influencing park usee Though it is 
obvious that the population shifts, the national and 
regional economic changes and special transit activities do 
influence the use of parks, it should be pointed out that it 
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is not obvious how to build these variables into a modelling 
frameworke This is a problem that has received extensive 
treatment as part of the CORD Study and Technical Notes on 
this topic cited in the paper. 
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CORD STUDY TECHNICAL NOTE 41 


THE ECONOMICS AND POLICY IMPLICATIONS 
OF PRIVATE CAMPGROUND DEVELOPMENT 
IN NATIONAL PARK CENTRED VISITOR REGIONS: 
AN EMPIRICAL STUDY 


Je Lewis, Ge AndrewS, Te O* Riordan 


AXMSTRACT 


In Canada, one of the poorly documented segments of the 
tourism and recreation industry is that of private 
campground development and operatione In particular, the 
relationship between the public and private sectors has not 
been fully examinede This study investigates the viability 
of National Parks-oriented private campgrounds to provide 
information that can be used to recommend public policy 
optionse 

After a review of the Literature, the paper focuses on 
four eastern parks and the public-private campground mix in 
and around theme It describes a survey of the operators of 
forty private campgrounds, the analysis strategy followed, 
and the results obtainede 


PURPOSE 


The objective of this study is to investigate the 
viability of National Park-oriented private campgrounds to 
provide information for the evaluation of alternative public 
policy optionse 


INTRODUCTION 


In Canada, one of the most poorly documented segments 
of the tourism and recreation industry is that of private 
campground development and operatione In particulary the 
relationship between the public and private sectors has not 
really been examinede With the increase in camping pressure 
over the years, it is now apparent that problems are arising 
and that questions pertinent to public policy should be 
investigated to reduce or avoia existing or potential 
conflict between the public and private sectorse Specific 
problems include: how to best satisfy the demands of the 
different tyres of campers 35 how to generate the greatest 
Local impact in terms of income and employment; how to avoid 
the detrimental impact on the viability of certain parts of 
the private sector; and how should National Park planners, 
policy makers and managers react to sugzestions of 
expansion, curtailment or differentiation ot their 
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activities (in Light of the private sector) in the provision 


of campground sites and facilitiese 


LITERATURE REVIEW 


General 


Camping continues to grow in popularitye In 19675 the 
Bureau ot Outdoor Recreation (see Reference 10) reported a 
62 percent increase in camping days by Americans between 
1960 and 1965, and projected a 450 percent increase in 
camping days between 1960 and 20006 However, by 1972 the 


Bureau noted that the total number of camping days had 


aguintupled (from 66,000 to 354,000) in the decade of the 
sixties alone and that during the same period, the number of 
camping days per person had grown from 06.9 to 3-0 (see 
Reference 10 )e L 

Canadian data echo the USA findingse In 1968 a national 
survey (see TN 22 and Volume III) found that 25 percent of 
the interviewed sample camped in 1967 campared with 15 
percent of those contacted four years previouslye A similar 
survey conducted in 1972 found that in the period 1969-1972, 
participation in tent camping had grown from 12 to 19 
percent, trailer camping from 6 to 10 percent and pick-up 
camper use from 2 to percent amongst Canadians 
interviewede It is particularly significant to note that 
while participation in camping as an activity has increased 
there has, in recent years, also been an increase in the 
number of camping days per person, ie@e, individual campers 
participating in camping have actually increased the number 
of days that they camp during the camping seasone 

In terms of the Canadian Nationat Parks system, camping 
demand has grown consistently at around 7 percent per year 
for the past 22 years; a doubling every ten yearse 

The introduction of the recreation vehicte (RV) 
coincided with important shifts in motivations and 
expectations toward camping, changes that can be discerned 
at all tlevetls of the camping experiencee While the hardy 
tenter seeking solitude and a wilderness or quasi~wilderness 
setting has by no means disappeared, he is finding it more 
and more difficult to obtain a satisfactory camping 
experience in the sense that his expectations are fully 
realizede He is competing with another tenting "culture" 
benefiting from modern Lightweight and high quality 
equipment and appearing to be more "urban oriented", thereby 
expressing a preference for more developed back country 
campgrounds, enjoying a certain degree of sociability and 
generally regarding the back country trip as a pleasant but 
short-Lived relief from urban grinde There is Little hard 
data to support these observations, but Stankey (Reference 
57) and Lucas (Reference 44) have both observed that growth 
in wilderness demand has primarily come from a group or 
groups who spend a relatively short time in the back country 
(one to two nights), who hike relatively short distances and 
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who prefer established campgrounds to camping in the "wild". 
(This observation has been supported by Ian Leaman, Regional 
Parks Officer in the British Columbia Parks Branch, 
Department of Recreation and Conservatione Leaman estimates 
that 90 percent of ‘back country users' seek the standard 
campground facilities provided despite the vast tracts of 
wild country that Lie immediatley adjacent to such arease ) 

While there may be two or more tenting cultures, the 
group referred to above as a whole. differs noticeably from 
the RV camping culture in terms of motivations, expectations 
and camping behavioure Clark and his associates (Reference 
16) found that campers in large wayside campgrounds sought 
high quality conveniences, entertainment and companionship, 
and showed a great tolerance of crowding, noise, Litter and 
even vandalism, while still feeling that they were enjoying 
a "wilderness" experiencee Clark et al called this group the 
“urban camping culture" for in many respects they are 
transferring an urban or suburban Living pattern to a 
quasinatural areae The question soon becomes obviouse Should 
public agencies be the major provider, or in any way a 
provider, of campgrounds for these types of camping 
enthusiasts? 

Shafer (see Reference 56) points out that campers are 
quite selective in their preferences, deliberately seeking 
certain kinds of campgrounds to suit their needse La Page 
(see Reference 37) found that campers who chose private 
campgrounds were far more likely to enjoy meeting other 
campers than were campers visiting public campgroundse La 
Page and Ragain (see Reference 41) investigated the changing 
camping behaviour of a mixed panel of 565 campers and found 
that a significant number of the "hardy" class of camper had 
cut back on their camping participation due to crowded 
facilities and the difficulties of finding camping 
environments Satisfactorily suiting their preferencese in 
noting that the decay in camping participation by this group 


is offset by the increase in "first time" campers’ more 
tolerant of crowding and damaged natural environments, the 
authors warn that the "example" and "Leadership" in "good 


camping behaviour" provided by this group may be declining 
to the detriment of the camping fraternity generallye 

The supply of campgrounds is another factor that has 
influenced the growth in camping demand, yet has also 
reflected the differences in camping preferences among 
modern camperSse The public sector has responded positively 
to the recent growth in camping demand and, as a result, 
provided many of the new campgrounds presently existing in 
North Americae Part of the reason for this is a prevailing 
belief that outdoor recreation is a social good that should 
te provided as a public service at subsidized pricese This 
is a controversial policy, with a long historye It stems in 
part from a belief that outdoor recreation serves’ a vital 
function in maintaining a stable and healthy community, and 
in part that a subsidized service is the best way to ensure 
that low income groups are not unduly discriminated againste 
For discussion see Searles (Reference 55), Mack and Meyers 
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(Reference 45), Gans (Reference 25) and Bechter (Reference 
5S)e Public campgrounds, particularly those found in National 
and Provincial/State parks, provide remarkably high quality 
and durable through somewhat standardized basic facilities 
at Low pricese It is, therefore, not surprising that new 
public campsites cannot be built fast enough to meet the 
burgeoning demandse Public policy makers, therefore, must 
decide 5 Kis As De 5 is their objective to continue to try and 
satisfy increases in campers!’ demands or to satisfy only on 
a selective basis camping demands that cannot be met by the 
private sectore 

Public sector growth of campground development is 
exemplified by British Columbiae Since 1956, the British 
Columbia government has increased the number of campsites in 
its provincial park system from 711 to 4,674, yet the number 
of camper days per campsite is still growing (see Table 1). 
In fact, throughout most of the peak demand period (July and 
August) some 30 to .50 percent of the provinciat park 
campgrounds are full, causing uncounted numbers of 
disappointed campers to search for other accommodation (see 
Reference 51 )e The picture in the Canadian Nationat Park 
system is Little differente Of 99 campgrounds for which firm 
data were available, 82 turned away campers at teast once in 
July and August 1972, 47 were full on at least 10 days, 27 
on at least 30 days, and eight on at least 60 dayse 

The results of all this is growing pressure on publicly 
provided campgrounds, which in effect means an increasing 
subsidy by the general taxpayer to the usere From camper 
fees, Canadian national park campgrounds generate only half 
the annual costs of maintenance and capital depreciation, 
according to evidence on file with Parks Canadae This means 
a subsidy of the order of $3 per camper night, or a total 
annual subsidy of some $3 millione British Columbia 
provincial park campgrounds are enjoyed by campers who pay 
only $2 -- one-third of the annual cost of construction, 
depreciation and maintenance -- a subsidy of some $4 per 
camper night, or a total annual subsidy of some $5 millione 
The situation would not appear to be too different in other 
provinces of Canada offering campgrounds in provincial parks 
or similar arease The policy has traditionally been to 
provide campgrounds for all types of park visitors and to 
subsidize their operation and maintenance costs as’ well as 
their capital depreciatione With increases in camping 
participation, the types of campers and the number of camper 
days per person it would appear that public policy makers 
should reassess this situation, especially in light of an 
increased capability of the private sector to do part of the 
jobe 


The Private Campground Industry 
La: Page (see Reference 40), Burch (see Reference 8) and 
Bevins (see Reference 6) have made some interesting 


observations about the evolution of the private campground 
industrye Bevins likens the pattern to the growth of the 
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TABLE 1 


CAMPING UNIT ATTENDANCE AND THE PROVISION OF CAMPING UNITS 
IN BRITISH COLUMBIA PROVINCIAL PARKS 


1955-1970 
Camping Nights Number of Number of 
(in thousands ) Camping Units Camping Nights 
per Campsite 
1955 160 711 225 (S6e25)* 
19586 200 | 1048 190 (47.50) 
12587 250 1404 178 (44.50) 
1958 400 1818 220 (55-00) 
1959 550 ‘ 2255 244 (61.00) 
1960 650 2797 232 (58.00) 
1961 760 3205 237 (59.25) 
1962 840 3664 229 (57.25) 
1963 910 3688 246 (61-50) 
1964 850 3695 230 (57250) 
1965 1100 3733 294 (73-50) 
1966 1170 3768 310 (77-50) 
1967 1350 3802 2355 (88-75) 
1968 1280 4290 2S8 (74.50) 
1969 1380 4625 300 (75-00) 
19370 1580 4674 338 (84-50) 


* This figure represents the average number of people using 
a camping unit for the appropriate yeare Actual nightly use 
cf campsites can be estimated by dividing foure This figure 
is provided in paranthesese 


Source: Parks Branch, British Columbia Department of 
Recreation and Conservation; Annual Reportse 


<> <>< <K9<K> 


erocery store, from the small come—by-—-chance operation 
through successive stages of centralization and product 
specialization to the large supermarkete In essence, the 
evolution of the private campground is sketched as follows: 


(1) Scattering (1900-1950). A disparate scattering 
of small conmercial tacilities, primarily 
catering to a well-established clientele, 
@egey hunting and fishing groupse 


(2) Invasion (1950-1°65)e The growth of the family 
camping market encourages small come-by-chance 
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campgrounds, generally family-run or on 
family-owned land, not always well situated in 
relation to tourist travel patternse Some do 
well by virtue of location, good management or 
sensible investment, but many remain small 
peripheral operations that fail to break even, 
even if labour is priced at its opportunity 
coste 


(3) Concentration (1965-197? )e The rise of the 
large, high investment campground, efficiently 


run and well promoted. These are often 
franchised to camping chains, (eegey 
Kampgrounds of America, Crazy Horse 


Campgrounds, Safari CAmpgrounds, etce), they 
are managed as sound business propositions and 
are expected to make moneye They cater 
especially to the urban camping culture 
mentioned earlier, and expect to make as much 
money from day use facilities as from nightly 
rental fees, if not moree (Bevins estimates 
that these campgrounds can expect to return at 
least 8 percent on investment over 20 years, 
even with moderate managemente KOA operators 
expect to take $S per camping family per over 
and above rental fees, usually $4. ) 


La Page et ale (See Reference 42) found that the number 
of commercial operations in New Hampshire grew at an average 
annual rate of 863 percent over the seven year period 
1964-1971, though there has been a more recent downward 
trend in the construction of new campgroundse Moeller (see 
Reference 49) pointed out that tor New England generally, 
the ratio of public to private campgrounds shifted from two 
to one in favour of public campgrounds in 1961, to four to 
one in favour of commercial enterprises in 1967. In British 
Columbia, Anderson (see Reference 2) found that private 
campgrounds outnumbered public facilities by a factor of 
four, and that 160 of 210 private operators interviewed 
planned to expand their facilities in the foreseeable 
futuree A similar indication of optimism was reported in the 
previously cited UeSe studiese 

On the surface at least, the private campgrounds in 
North America present a paradox for they generally do not 
pay, yet they are numerous'9 and proliferatinge All studies 
point to the marginal nature of the majority of existing 
private operationse Johnson (see Reference 32) remarked that 
at least three of five private campgrounds fail within five 
years, while half of the remainder do not break evene La 
Page et ale found that 89 of 167 reporting campgrounds in 
New Hampshire were unsuccessful in the sense that either 
income and/or occupancy was below average and revenue was 
insufficient to meet average costse They comment (page 10 
of Reference 42). 
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A simple comparison of costs and returns revealed 
that the average campground will not break even 
unless: (1) the owner can avoid paying for labour 
and ignore depreciation, which is probably what a 
great many campground owners have done in order to 
stay in business; or (2) the campsite rental 
income can be supplemented by additional revenues 
that will net at least 19 cents for every dollar 
of campsite rentalse 


In Anderson's study, 50 percent of the 217 reporting 
managers claimed they made some profit, 25 percent said they 
broke even, and 25 percent admitted to taking a lLosSe 
However, the costs of labour and general maintenance were 
not incorporated in their budgetary breakdowne 

The successful campground should be large enough to 
attract a sufficient "critical mass" of campers to make 
these critical on-site, noncamping facilities paye To meet 
the narrow criterion of profitability, (defined initially 
even more narrowly in terms of attendance) most writers 
agree that the minimum number of units is 70, though 
profitability correlates positively with size through a very 
wide range above this numbere For example, ina survey of 
New Brunswick private campgrounds (see Reference 50) it was 
found that the average occupancy rate for campgrounds of 
less than 10 units was 13. percent, while for campgrounds of 
over 100 units, it was 41 percent. 

Most of the studies available show that at least half 
of the private operations are successful, within the 
parameters of success established by the managers 
themselvese Success is a subjective Phenomenon, which is 
clearly Linked to management goals and motivationse There 
has been an implicit assumption that the private campground 
enterprise should meet all the requirements of a profitable 
business to be successful —-- promotion of product, 
differentiation of service, efficiency of management and 
maximization of new revenue through optimal fee structurese 
Yet a number of writers (for example Loomis and Wilkins, 
Reference 43, and Bevins, Reference 6; have hinted that 
other motivations, not based on money or profit, might 
applye This matter is of fundamental importance in 
assessing the tunction of the private campground sector in 
modern times, for a purely economic calculation Simply 
cannot be applied in assessing the viability and intrinsic 
merit of this sectore Therefore, in developing policy for 
the private campground sector these types of factors should 
be considerede 

What motivates the private campground owner/manager? 
The ordering of the List that follows is not of particular 
Significance, as any combination of factors can be expected 
in individual casese 


(1) Personal considerationse This group of factors 


includes the wish to remain active in 
retirement, despite a comfortable income, the 
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Love of meeting people, the satisfaction of 
providing a service that is appreciated and 
the contentment that comes from setting up and 
operating a family businesse These are psychic 
benefits, which may act as very powerful 
drives to certain individuals who still seek a 
sense of fulfillment in their Livese 


(2) Family considerationse The private campground 
is often regarded as a family enterprise to 
provide employment for the wife, children and 
relativese The short season and variable 
weekly occupancy rates, plus the somewhat 
"menial" chores involved are most appropriate 
for family labour with a stake in the total 
investmente 


(3) Joint business operatione In all but a few 
instances, the private campground is a 
secondary source of incomee In many cases, the 
primary occupation is a related business (a 
motel, riding stable, restaurant, etce ),y but 
increasingly the campground may Simply provide 
a relief from a city jobe 


(4) Investment securitye Ina period of sharply 
rising land values and steady erosion of the 
earning power of the dollar, the private 
campground is regarded as a sound investment, 
even though it may lose money on a year basise 
Land values are rising at between 7 and 10 
percent annually -- even higher in areas that 
are recreatively attractive (lakeshores, 
hillsides, etce) or beside areas of great 
recreation intereste Operating a private 
campground on choice land can often be a 
convenient method of offsetting taxation and 
may provide a useful holding operation before 
selling out to some profitable usee 


According to La Page's hypothesis, the successful 
campground is characterized by a high proportion of repeat 
campers, a lengthy duration of stay and widespread word of 
mouth "advertising" to attract new camperse The three 
interact, for the satisfied camper is more likely to return, 
to stay longer and to tell his friends and acquaintances 
about the campgrounde 

While all these things are important, the human element 
is vitale The campground owner should be willing to work 
hard, to enjoy open socialization, and have a friendly, open 
personalitye Courteous, friendly and responsible management 
not only maintains a clean attractive site, but encourages 
campers to return and to recommend the place to their 
friendse La Page (Reference 36) and Echelberger and Shafer 
(Reference 23) found that managers who were experienced 
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campers tended to antagonize their clients by providing a 
service which they thought was appropriate, but to which 
many campers objectede Clark et ale (Reference 16) pointed 
out that there is a disturbing difference of opinion between 
what many campground managers regard as a suitable 
atmosphere and the kind of experience many modern 
recreationists are seekinge 

Though profit maximization may not be a primary motive, 
the private campground manager is nevertheless interested in 
attracting a large and regular camping clientelee In order 
to be successful in this regard, he must understand and seek 
to meet the preferences of the particular camping culture he 
wishes to attracte La Page (Reference 40) has developed a 
useful schema of camper preferences and management behaviour 
on which Figure 1 is basede 

What are the features of the private campground that 
satisfy the camper in the first place? Certainly a neat, 
clean campground, with high quality sanitary facilities is a 
primary requisitee The CORD Motivation Study (see Volume 
III) showed that the quality of sanitary facilities, good 
Grinking water and a clean campground are regarded by most 
Canadians as essential factors for making a good camping 
tripe The growing use of the RV has’ placed much importance 


on the "hook-up™"™ campsite -- water and electricity 
certainly, and in many cases sewage disposal as welle Many 
campers simply expect this Level of service now --— and 


generally appear to be willing to pay for ite Security 
against petty vandalism and unruly behaviour is also a very 
important factore Indeed many campers deliberately choose 
private campgrounds to be sure of a peaceful night's reste 
In both cases the private sector can provide a better 
service than most, if not all, public campgroundse 

Also very important is the provision of some form of 
“day-time entertainment". Successful private campgrounds 
provide a mix of services to meet campers! preferences on a 
24-hour basis: the idea of overnight accommodation and 
little else is) only successful for transit campgrounds 
Located adjacent to major travel routese Daytime facilities 
include swimming pools, rental facilities, grocery and gift 
stores, entertainment shows and the Likee 

Preferably, too, the campground should be near a major 
scenic or tourist attraction, such as a park, an historic 
site or an urban centree The presence of water-oriented 
recreation facilities seems to be particularly appealinge 


Attractive recreation sites seem to attract 
campgrounds, and hence it is not unusual to find a marked 
clustering of campgrounds around areas of recreation 


intereste Anderson (Reference 2) found that 61 percent of 
his interviewed sample of 208 managers were less’ than 10 
miles from a public campground, and 74 percent were within 
five miles of private competitorse (Moeller, Reference 49, 
has similar evidence of clustering for New England 
campgroundse ) Clustering is advantageous in that it provides 
flexibility of choice for the camper, allows for the 
absorption or overspill and permits the positive 
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Figure ] 


[Camper Preference | 


past visits, camping experience, trip plans, preference 
POPreaupuround. a tiributesnd Size, eraci Lites « sanitation, 
age, location, accommodation arrangements, etc.) 


~ oh i chee 
[Increasing Wes Gases a 


Slower turnover rate 

guaranteed customers 
(reservations ahead) 

good references (word of 


[Greater Likelihood of Return | € 


good management-customer mouth) 
relations (improvement of lower labour costs 
facilities) ancillary income from 


guaranteed customers 


additional expenditures 
lower advertising costs ; 


Simple diagram showing relationship between campground 
attributes, customer preferences and management to 
make a private campground viable. 

(after LaPage, 1967, 1968a) 
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externalities of advertising and promotione 


SUMMARY 


There is no doubt that the private campground can 
provide a set of services that are attractive to a segment 
of the modern camping culturee How. many campers’) actually 
prefer private campgrounds to the exclusion of public 
campgrounds (and vice versa) is not Known, but there is some 
evidence that a sizeable proportion of modern campers is 
ready to go to eithere Both The CORD Motivation Study anda 
more recent study by La Page and Ragain (see Reference 41) 
showed that between 20 and 40 percent of touring campers can 
be termed "impulse campers", willing to shift their travel 
schedule at a moment's noticee Obviously, this group 
provides a@ prominent target for the sharp entrepreneur, 
eager to encourage them to stay for the extra day or SO, and 
to return in the futuree It would appear, therefore, that 
public agencies should seriously consider a policy of 
divesting themselves of part of the responsibility of 
providing “all things for all people" when it comes’ to 
campground tacilitiese 

Public campgrounds in national and provincial/state 
parks tend to provide a relatively standarized quality 
service that is subsidized by the taxpayere These 
campgrounds are becoming increasingly expensive to construct 
and operate, and are regularly over-crowded during the peak 
seasone The private sector is prepared to offer a 
complementary service to absorb part of the demand, and to 
provide the kind of camping experience that a sizeable group 
of campers prefere Should the public sector follow a policy 
oft giving the private sector more of this responsibility, 
then it may follow that the public sector will, in the 
future, have additional funds to allocate to the provision 
of campgrounds of the type that the private sector cannot 
rossibly providee 


SURVEY OF PRIVATE CAMPGROUNDS IN ATLANTIC CANADA 


Four National Parks formed the foci for this stuay -—- 
Prince Edward Island, Cape Breton Higlands, § Kejimkujik and 
Fundy (see Figure 2)e Table 2 portrays the public-private 
campground mix in and around these parks and also indicates 
the popularity of the public campgroundse 


The author (O*Riordan) contacted and interviewed the 
operators ot 40 private campgrounds in the recreation 
hinterlands of these National Parks -- eight in the vicinity 
of Fundy, five near Kejimkujik, seven around Cape Breton 
Highlands and 20 in Prince Edwara Islande The interviews 
followed a semi-structured formate Certain key questions 
were asked of all respondents, but there was plenty of 
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TABLE 2 


THE PUBLIC-PRIVATE CAMPGROUND MIX 
SURROUNDING ATLANTIC CANADA NATIONAL PARKS IN 1972 


Number Number of Number of Nights 
of Park Private Park Campgrounds 
Campsites Campsites Full or Nearly 


in vicinity* Full in 1972 


Prince Edward 


Island (PEI) 613 (3 )*x 1600 (24)*x 50 
Cape Breton 

Highlands , 

(Nova Scotia) 933 (7) 533  €7) 10-30*** 
Ke jimkujik 

(Nova Scotia) 330 (1) 300 (5) 30 
Fundy 

(New Brunswick ) 960 (3) 784 (8) 30 


* defined as within a radius of 40 miles 
** number of campgrounds 


***X varies from one campground to another 
OT OTOTLOLS: 


flexibility built into the interview schedule to permit a 
reasonably deep insight as to the nature of the private 
campground industry in Atlantic Canadae Each interview 
lasted for at least an hour, and many exceeded two hourse 


Analysis Strategy 


Analysis of the interview results showed that four 
categories of campground could be distinguished on the basis 
of scale, level of service provided, management motivation, 
income generation and profitabilitye These are: 


(a) the small scale, well established campground 
(60 units, 5 years in operation); 


(b) the small scale, recently established 
campground, (60 units, 5 years in operation) 
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(c) the middle scale, well established campground 
(60 -—- 150 units, 5 years in operation); 


(d) the large scale, recently established 
campground (150 units, 5 years in operation)e 


R ts 


It should be noted that there is no dearth of private 
campsites in the hinterlands of the parks surveyed, yet in 
three of the four areas, public campsites outnumbered 
private campsites (see Table 2)e 

While income generation was obviously an important 
consideration, few were in the campground business to make 
moneye The viability of the industry should be evaluated as 
much on its merits for providing a local, friendly service 
to travellers as on -the more rigid yardstick of dollar 
profitabilitye Before analyzing the industry in detail, 
first some general observationse 

The Atlantic Provinces are not geographically well 
suited for large numbers of touristse Their own populations 
are relatively small, they form a destination region rather 
than a travel routeway, and they suffer from a comparatively 
short summer seasone Geographic isolation does mean that 
once travellers decide to visit the region they tend to stay 
a whilee The unpredictable summer weather encourages the 
travel-minded to tour around, making it difficult for 
campground operators to capture the prolonged attendance 
which is so desirablee 

Probably the biggest problem facing the campground 
manager is the short seasone The peaking of camper demand 
for the private sector is quite dramatic: with some 
exceptions, most campgrounds report less than 10 percent 
occupancy up to June 30, 40 percent July 1 to 7; 90 to 1900 
percent July 8 to August 13, 40 percent August 14 to 28 and 
10 percent August 28 to Labor Daye Comparative occupancy 
rates for National Park campgrounds for two selected months 
in 1972 are detailed in Table 3e General exceptions to the 
rule are campgrounds situated beside well travelled roads or 
in secluded water-oriented recreation areas which pick up 
early and late season local tradeece 


The sharp peaking of camper demand means that 
campground orerators have to provide high quality expensive 
facilities (to meet the discriminating tastes of modern 
campers) for a season that effectively lasts for only five 
weekse Of particular concern is the marginal cost-benefit 
calculation for additional facilities such as flush toilets 
and showers, which may only be used during the last two 
weeks of Julye With current costs (even excluding labor 
which is usual internalized) and high interest rates, these 
expensive marginal investments could take 15 or more years 
to be paid offe (For example, modern sanitary facilities 
(toilets and showers) with proper sewage disposal can cost 


TN 41 CHAPTER VIII page 777 


TABLE 3 


OCCUPANCY IN MARITIME NATIONAL PARK CAMPGROUNDS 
JUNE AND SEPTEMBER, 1972 


Number Number of Party 
of Nights /Day %® Occupancy 

CAMPGROUND Sites June September June Septe 
CAPE BRETON 

Ingonish 108 6 6 5 5 

Broad Cove 218 36 33 12 11 

Black Brook ry te 3 6 1 2 

MacIntosh Brook 30 37 33 10 10 

Cheticamp 250 ~ 196 25 13 10 
PeEele : 

Stanhope 170 54 34 30 20 

Rustico Island 148 14 60 10 40 

Cavendish 320 147 64 42 22 
FUNDY 

Chignecto 525 12 8 2 1 

Tenting 

Headquarters 125 716 46 65 37 

Trailer 

Headquarters | 29 24 23 80 75 

Point Wolfe 250 a | 7 11 3 

Wolfe Lake 60 ¥ -— 11 -- 
KEJIMKUJIK 

Jeremy Bay 330 39 47 Ey 13 

OT OTOTOLO 


between $20,000 and $30,000: at 10 percent, this would mean 
annual payments of $2,350 -— $3,425 over 20 yearse ) 

Another factor contributing to the short, Sharp nature 
of the private campground season is the competition provided 
by the public parkse In PEI for instance, few residents of 
the province appear to camp in private campgrounds: usually 
the beach is within a day's drive of home, or people tend to 
camp in public parkse In 1972; some 55 percent of alt 
provincial park campground users were Islanders, and while 
only 8 percent of the PEI national park campground users 
came from PEI, some 50 percent of all users in June and 
September were local peoplee In Nova Scotia and New 
Brunswick, a Similar pattern has been observed by Local 
officials, though hard data is less easy to come bye But the 
practice of leaving a trailer in a secluded private 
campground for weekend use throughout the summer is growing 
in popularity, and provides a steady and guaranteed income 
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for the private operator, though often the rates are 
remarkably lowe (Season rentals ranged from $100 to $200 or 
about $1 - $2 per day for a fully serviced site. Daily 
rentals for similar sites range from $3e50 to $5.00.) 

The four classes of campgrounds (as outlined in the 
analysis strategy) are described as follows: 


(a) The small-scale, welt established campground. 
Five campgrounds fell into this class, and 


none made any moneye Despite small capital 
investment ($20,000) and very low operation 
and maintenance costs ($800 — $1,5000), plus 


very hardworking family Labor, camper revenue 
rarely exceeds $2,500 annuallye The moderate 
revenues reflect low attendance figures 
(rarely exceeding 20 percent for the season), 
and low fees ($1.50 to $2650 per unit per 
night)e Any . net income is automatically 
ploughed back into the business for small 
scale improvements, but in 3 of the 5 cases, 


the owners were paying $1,000 - $2,000 
annually out of their own pockets to keep 
their operations Zoinge The owners are 


basically trapped by the peculiar economics of 
the private campground industry, (see Figure 
3 )e 

Because their campgrounds are relatively old, 
the basic facilities are somewhat primitiwe in 
the view of many modern campers, and often in 
need of repaire Competition has forced most of 
this group out of businesse Those that hang on 
exist largely because they enjoy meeting and 
serving campers, and are committed to the 
operation as an investmente Because none of 
the owners is financially well-off, it is 
difficult for them to break out of a financial 
bind to upgrade existing facilities and, in 
any case, their campgrounds are not regarded 
as a sufficiently important source of income 
to warrant such a movee So the owners rely on 
repeat business (the length of time in 
operation is crucial here) and word-of-mouth 
advertising, can depend upon attracting some 
people off the road who are unable to find 
accommodation in nearby public facilitiese 
Depending upon their Location in relation to 
recreation areas and/or main highwayS, repeat 
business ranges from 10 percent to 80 percent, 
multi-night campers range from 5 percent to 60 
percent, and word-of-mouth referral ranges 
from 20 percent to 80 percente Even without 
some form of low-interest loan program, and 
assistance in advertising and technical 
services, some of these campgrounds will 
probably tick along at the margin for a number 
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otf years to comee 


(b) The small-scale, newly established campground. 
As mentioned earlier in the general review 
(but contrary to the evolutionary process 
described above) this category of private 
campground is growinge Fifteen of the 41 
campgrounds studied fell into this classe This 
group is caught in another kind of trap (see 
Figure 4). Because these campgrounds are new, 
they have no established camping clientele and 
must compete ina tough market to attract the 
modern campere This means expensive high 
quality facilities, including 2- and 3-way 
hookups y good showers and toilets, sewage 
disposal and a grocery storee Investment 
costs easily run to $50,000, and operating 
costs are also high (6,000 — $12,000 annually) 
due to high interest payments, rapid 
depreciation, insurance and utility bills, 
advertising costs and, in some cases, part— 
time lLabore 
During the initial years, camping revenue is 
pitifully small - $500 -—- $2,500 annually. 
About 90 percent of campers spend less than 
two nights, particularly if the weather is 
poor, so the difficulties of enticing them to 
stay longer are most challenginge Low camper 
demand is not simply due to newness, but often 
to poor location: 5 of the 15 campgrounds in 
this group were located off main highways, and 
another 8 had no recreationly attractive 
additional campground features to encourage 
campers to Lingere The long-term prospects of 
this group wilt depend upon Location, 
management and quality and variety of on-site 
servicese Any form of low-cost advertising is 
tremendously important, as would be lLow- 
interest Loans and technical assistance, to 
assist expansione But it should be emphasized 
that this category of campground will always 
be financilly marginal, unless its capital 
investment is expanded and it develops a more 
ageressive marketing policye But the question 
arises - is there enough camper demand to 
justify this increase in supply? This issue 
will be discussed in the last section of this 
papere 


(c) The miaale-scale, well established campgrounde 
This group is well established (9 of 13 had 
been operating tor five seasons or more) is 
generally well Located and has invested in 
some on-site facilities to encourage multi- 
night visitSe This eroup is solvent, well 
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Located vis-a-vis recreation attractions 
(including the nearby national park), and 
benefits from a satisfied camping patronage, 
as outlined in Figure le Whereas 16 of the 20 
small-scale campgrounds reported that 40% —-— 
90% of visitors stayed only one night, and few 
remained more than two nights, the majority of 
campers visiting the middle-scale group stayed 
for a week or even two weeks. Capital costs 
range from $40,000 to $100,000, depending upon 
the number of campsites and the capital 
intensiveness of income generating activities 
(see Figure 5). Operating costs are also high 
- largely because of additional labor -— 
running from $3,000 - $8,000 annuallyse The 
revenue picture is bright - ranging between 
$5,000 to $15,000 from camper fees and $1,000 
to $15,000 from additional facilities. Net 
revenues ranged from around $2,000 to $10,000 
or more, depending upon whether the ancillary 
business was campground related or note In no 
case are big incomes being made, but these 
owners can face competition and provide a very 
acceptable servicee In many cases, the 
building up of the campground to improve its 
investment potential for the capturing of 
added value (whether by hard work or by 
speculative pressures ) is a major economic 
motivatione 


(d) The large-scale campgrounde This group is 
distinguised by size (exceeding 150 units), 
scale of investment (exceeding $100,000) and 
the fact that some of the management is 
professionale The aim is to make money, 
particularly through day use facilities 
(restaurants, gift shops, pin ball machines, 
rental facilities, etce )e Unfortunately, all 
but one of the seven campgrounds falling into 
this class were in operation for less than two 
seasons, so it is impossible to predict their 
viabilitye Investment costs are enormous 
(ranging up to one million dollars) and 
operating costs are likewise high ($20,000 to 
$50,000) -—- the labor costs alone may exceed 
$25,0006 Indeed, one of the Large-scale 
campgrounds spent more on cash registers alond 
($25,000) than a well established small-scale 
campground owner will invest in his Lifetimee 
This group relies On expensive publicity 
(aavertising costs range from $1,900 to 
$4,000), on the very best quality sites, and 
on expensively planned spacee Two- and three- 
way hookup sites are almost mandatory; costing 
$600 to $2,000 each to install (as against 
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$200 to $1,000 for similar sites in smaller 
campgrounds, where most of the operators do 
their own plumbing and wiring -- even if land 
costs $500 to $1,000 an acrese 

Of critical significance to the tLlarge-scale 
campground is the fact that the future profits 
wilt be determined (though to an uncertain 
extent) by national park campground policye 
Initially, the private operators' aim is to 
absorb and hold as much of the national park 
overspill as possiblee Their more distant 
objective is to see the national parks abandon 
the campground business altogether. It soon 
becomes apparent, therefore, that if national 


park campground policy is unclear or 
continually changing, then even operators 
fitting the category of Large-scale 


campgrounds will find it difficult to make 
wise decisions and long-term commitments to 
their enterprisee 
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DISCUSSION AND CONCLUSIONS 


A national park, in many cases, forms a focus fora 
tourist or recreation hinterland which, in certain 
instances, can provide a variety of recreation experiences 
of a natural, social and cultural charactere The park offers 
a distinctive natural setting in which camping should be a 
complementary activity, impinging only Lightly on the 
intrinsic natural values of the parke Simple functionalism 
should be the policy -- basic facilities, no frills, but the 
opportunity for different camping cultures to seek a 
satisfying experience within a Limited range of campground 
options provided by the parke 

It would appear that national park campgrounds should 
offer a complementary service to the private sectore Where 
unnecessary competition exists, it should be eliminated, as 
may be the case with national parks providing fully serviced 
campsitese : 

A second area of unnecessary competition may exist in 
the area of early and late season campinge National park 
policy allows for campgrounds to be made available to park 
visitors early and late during the season at no charge, 
Since it is not economically viable to collect feese The 
policy rationale in this case is that the parks and 
campgrounds are paid for by all Canadians; so they should be 
made available if at all possiblee However, this policy 
makes it impossible for adjacent private campgrounds to 
compete during the off seasone Obviously, the policy 
requires some re-evaluatione 

Comparative fee structures should also he analyzede In 
some cases, the national park campgrounds charge more for 
serviced sites and less for unserviced sites than does the 
private sectore Should national parks raise their fees for 
the Latter kind ot site, the private sector (especially at 
the smaller scale) would benefit enormouslye Yet camper 
demand would hardly be altered, as price elasticity in this 
part of the fee structure is pretty low (see Reference 39; 
51). The main point, however, is that it would appear that 
national park campground tee structure policy is developed 
with little regard for the private operatore 

In the United States, some national parks have embarked 
on a reservation system, which guarantees a site to those 
who buy an advance "insurance" feee Should Canadian national 
park campgrounds follow this practice? On the one hand, the 
reservation system can give campers greater assurance ofa 
place and can permit parks officials to turn overspill 
campers to adjacent private campgroundse On the other hand, 
a reservation system might stop campers from even visiting 
an area once they Learn that the national park campgrounds 
are fulle On balance, it would appear that a reservation 
policy should be discouraged, especially in areas where 
adjacent private campgrounds existe 

Since national parks, in many cases, serve as the focus 
of a tourist region, it might be suggested that they should 
be responsible, in part, for establishing (along with 
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provincial and ltocal tourist interests) regional tourist 
information centres, providing information not simply on the 
park, but on nearby indoor and outdoor recreation 
attractions and local accommodatione While this would 
require a change in the existing national park information 
policy, it would offer the benefit of providing the camper 
with informed choices as to the variety of campground 
facilities available, as well as giving him alternatives 
when his first preferred campground is fulle The notion of a 
national park-centred tourist region could give a much 
needed advertising boost to the smaller campgrounds, many of 
which would do quite well if they were better knowne 

It should be noted, however, that the above policy 
might not solve the private operator's initial financial 
difficultiese Here, a careful case-by-case analysis is 
necessary before any policy of low interest loans can be 
advocatede (With only two exceptions, campground owners 
surveyed did not desire a "handout! grant: they were happy 
to work to pay off a loane However, the peculiar economics 
of the industry might well necessitate subsidized loans -- 
say at 6 percent --— for certain basic facilitiese) The 
question of standards and technical assistance can partly be 
dealt with through existing minimum standards set by 
provincial agenciese But the sharing of technical know-how 
could be made possible through provincial campground owners’ 
associations, such as is done in New Brunswicke (A fine 
example of this practice is the work being done by the New 
Brunswick Campground Owners Associatione ) In conjunction 
with this is the related alternative of changing the 
existing national parks policy so as_ to enable technical 
expertise from the Parks Canada to be made available to the 
private sectore 

Finally, the national park-centred tourist region 
carries with it the more subtle issue of protecting local 
cultural valuese The well established small campground 
entrepreneur, who may appear economically marginal, is an 
intrinsic part of the local scenee He knows the folktore, he 
Knows local customs and he is in regular contact with local 
people who can offer a marvellous service to the camper 
anxious to identify with the local scenee The private 
campground industry is a part of a local culture, which is 
complementary to the natural scenery of the national parkSse 
To link the two would help forge a regional, participatory 
visitor service, which could be enjoyed by . local residents 
and non-residents alikee 
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REVIEW 


Le Douglas 


The papers presented in this chapter represent a 
paradoxe In a technical sense they are rather simple, at 
least in comparison to some of the technical notes found in 
other chaptersee On the other handy, some are highly 
innovative and provide significant additions to poorly 
developed area of park and recreation researche 

In the past, evaluation and allocation modelling has 
played only a minor role in Canadian park and recreation 
decision makinge Senior administrators have often stated 
that much of the analysis that has taken place has been too 
narrow in focus and not orientated to the type of decisions 
which’"managers have to makee These critics have challenged 
researchers to develop simple yet meaningful measures of 
level of service, frameworks for evaluating alternative 
sites and projects, and better methods for analyzing the 
economic impact of parkSe 

In a general sense TN 5, 17 and 26 provide the policy- 
maker with a number of ways of determining level of 
recreation service available to certain populationse 
Implicit in these models is the concept of equity of 
opportunity and an interest in isolating areas which are 
relatively well or poorly servicede TN 5 and JT develop 
measures of opportunity potentially available to different 
populationse These papers make no use of participation datae 
In contrast, TN 26 only uses’) participation data in 
developing internal standards for various areas and socio-— 
economic groupSse 

TN 5 shows how the concept of potential surfaces may be 
used to develop measures for evaluating the distribution of 
opportunities to participate in outdoor recreatione Two 
approaches’ for specifying "opportunity quotients" are 
developed in the articlee With the first alternative, it is 
proposed that competition for supply is best measured at the 
recreation Sitee However, a means of measuring total 
pressure exerted on a site is not proposede With the second 
alternative a proposal is made that competition be measured 
at the residence of each individuale The concept of 
population potential is suggested as providing a surrogate 
measure of this competitivenesse The “opportunity quotient" 
developed in the second alternative is, in some respects, 
suggested to be superior to the one developed in the first 
alternative since it more accurately reflects how a person 
perceives his relative opportunitye However, neither the 
question of which distance function to use with a specific 
gravity function, nor the question of how to obtain the best 
gravity function is answered in the articlee The matter of 
how to measure site attractivity is also left unattendede 

A number of matters need to be considered before it can 
be accepted that either of the proposed measure reflect the 
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recreation opportunities perceived by people Living at 
specific lLlocationse For examplee “opportunity quotients", as 
defined by the two alternative methods, should be calculated 
for specific areas and then compared with the perceived 
opportunities as stated by people living in those arease 
Other useful projects could investigate whether or not these 
measures relate to potentials as defined in the behavioural 
models postulated in TN 33-4 ) 

TN 17 shows how policies such as those requiring that a 
certain per capita level of opportunity be potentially 
available within two hours travelling time of urban centres 
can be used to define a model which is appropriate for 
evaluating how recreation opportunities relate to 
populationse In this context, it is not of direct interest 
to know whether people use the opportunities allocated to 
them by the mathematical model. Rather, the concern is the 
number of opportunities potentially available to people 
Living in various urban centrese Output from the model can 
then be used to evaluate the potential level of service 
presently provided to each urban centre in the study area 
against either a predetermined standard or an internal 
standard, such as the weighted average supply per capita for 
the study areae Once this evaluation is completed, expected 
population changes can be used as input to the model andy, in 
the absence of additional supply, future ratios of supply 
per capita can be computed for all urban centrese 
Alternative plans for introducing new supply into the system 
then can be developed and evaluated through repeated runs of 
the modelo Eventually, a plan can be selected which best 
relates to an agency's objective of providing a given level 
of service to the agency's expected fiscal resources within 
the planning periode 

The allocation and evaluation approaches described in 
TN 5 and 7 do not make use of information on people's 
behavioure In many cases, however, a decision maker may 
wish to take existing behaviour into account because it is 
believed that behaviour reflects level of opportunitye TN 26 
offers a technique which can be applied to participation 
data in order to develop internal standardse It is stressed 
in the article that the analysis of such data should be done 
in a disaggregated waye For example, it is suggested that 
better policies can be developed by looking at participation 
rates of age-sex groups as opposed to only tlooking at a 
crude participation rate for an entire populatione A careful 
Gistinction is made among various participation measures 
such as incidence of participation per capita, etce It is 
pointed out in the paper that the selection of a 
participation measure (or measures) to be used in an 
analysis entirely depends upon the objective of the anlaysis 
and the range of actions Likely to be taken as a result to 
the findings of the analysise In this regard it is stated 
that participation rates based on income differentials may 
not be of any interest in a program evaluation analysis, 
whereas they may be critically important to the master 
planning of a particular parke 
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The remainder of notes in Chapter VIII relate, more or 
Less, to evaluations and allocations based on economic 
considerationse The next two papers commented on consider 
the problems of project and site selectione 

A multi-dimensional scaling system for project 
selection is developed in TN 25e In this scoring system, 
intangible variables are combined to produce a single 
subjective score for each project on a List of candidatese 
Next, a procedure is described by which the intangible score 
can be combined with the corresponding benefit-cost ratio of 
the project in away that allows the decision maker to 
control the amount of influence that intangible factors have 
on the final project scoree 

The note includes some rather critical insights into 
the way intangible variables should be combinede For 
example, it is pointed out that the dimensions of evaluation 
should be independent and that the best alternative project 
on a dimension cannot be necessarily assigned a value of 
"100" while the poorest project is assigned a value of "O". 

There is no recognition in the paper, however, that the 
weight each dimension should have in determining a project's 
walue may vary from project to projecte In the paper it is 
assumed that an average weight for each dimension can be 
applied to each project in the liste This assumption is only 
valid when essentially similar types of projects are being 
comparede 

Still, this criticism of TN 25 is of minor importance 
given the more substantive problem of using intuition to 
specify scale valuese In many cases, lack of time or money 
preclude thorough research being undertakene Yet, a manager 
may still wish to subjectively include certain factors in 
the selection of a projecte The scoring system described in 
TN 25 allows such intangibles to be included in a simple yet 
structured waye By keeping economic and non-economic factors 
separate throughout the analysis, it is possible to combine 
them in the final step in a controllable waye The procedure 
described, consequently, provides the manager with the 
opportunity of deciding how far he is willing to deviate 
from an economic optimum in order to achieve other 
objectivese 

When an agency has used a model such as the ones 
described in TN 5S or TN 17 there still may be questions 
about making the optimal use of lande One approach to 
defining optimal use is to define an objective function and 
then to apply linear programming to determine whether 
certain lands would, for example, be best used for producing 
forest products, agricultural products, recreation products, 
or some combination of thesee So when, for example, growth 
ina recreation system has been defined using the methods 
describea in the discussion of TN 17, it might be considered 
feasible to develop a catalogue of potential lands for parks 
in areas which will face increasing pressuree Then, at any 
time, linear programming incorporating contemporary budget 
and other price constraints can be applied to select which 
of the lands shoula be developed for park purposeSse 
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The main point of TN 23, however, is that over- 
automation of planning can lead to poor decisionse The 
creation of a park rarely only has the objective of 
providing a certain number of picnick tablese One must be 
careful to realize that a Linear programming solution which 
indicates that a park should be developed on a particular 
site does not recognize differences in the number of jobs 
associated with different land uses nor does it recognize 
that future changes may occur in the relative price of the 
products that may be produced on that tlande Discussing the 
results of a sensitivity analysis of the linear programming 
solution then only serves to illustrate that there are many 
options which are not automatically pointed out or even 
realized when overly automated evaluations and allocation 
procedures are usede 

The authors of TN 23 note that the opportunity cost 
approach used for specifying the price of the outdoor 
recreation product in the Linear programming application 
only permits the assignment of a minimum pricee Furthermore, 
the opportunity cost approach says nothing about the value 
of the recreation product as perceived by consumers because 
of the absence of a demand curvee 

Related to the need to have an estimate of the value of 
a proposed park is the discussion about time bias found in 
TN 31¢ In this article it is pointed out that estimates of 
the demand function that are only based on travel distances 
may severely underestimate the value of certain parkse The 
authors suggest alternative ways of incorporating a factor 
to correct time bias into a demand function and then they 
proceed to compute the effect of these alternative 
corrections on the estimated value of a Parke Consequently, 
TN 31 provides the manager with an example of how widely the 
estimated value of a park may vary, depending on the 
assumptions used in developing the demand schedulee Given 
the sensitivity of the Linear programming solution found in 
TN 23, the manager may desire to have several estimates of 
value computed for a proposed park so that he can choose the 
one that he believes is most appropriatee 

TN 39 is really only a focused review of traditional 
economic impact of park Lliteraturee The perspective 
presented in this paper is complemented by IN 40 where a 
broader view is taken on how the assessment of impact 
relates to the park planning processe The note leaves much 
to be desired because the Literature on impact evaluation is 
now extensive and has progressed significantly since 1969-6 
Many of the very real problems in collecting data for 
assessment evaluation such as the distribution of benefits 
and costs are generally more subtle than recognized in this 
papere Consequently, the paper should only be used aS an 
introductory guide to impact assessmente 

TN 40 presents a broad perspective of the factors 
influencing the development of a park rather than a 
discussion of the classical economic issues related to the 
impact of a parke It is stated in the article that many of 
the impact statements prepared at the park proposal stage 
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have not turned out to be accurate because the factors 
defined in the original situations have changede Results of 
impact studies have frequently been invalidated because 
impact was not considered when the original development 
concept was altered during the acquisition negotiation 
process or during the master planning procesSe The main 
point of the article is that the assessment of impact can 
only really be carried out effectively when a close linkage 
is retained between the impact assessment and the 
acquisition-planning processe If impact assessment is 
considered important, it is not enough to study impact at 
one point in timee The entire planning process from concept 
development through master planning and construction must be 
auditede Consideration must be given to the various 
exogenous variables that influence development, and 
continuing feedback between the planners designing the park 
and those estimating its impact must be maintainede 

TN 41 provides a general description of the relations 
among campground characteristics and corresponding economic 
viabilitye The analysis, by its very nature, presents 
aggregated information and is consequently of Limited value 
in drawing specific conclusionse Still, such studies are 
useful since they provide general guidance to the planner or 
manager who wants to consider roles that private campgrounds 
might play in park development schemese 

Even at a general level, however, a discussion of 
campground economics should consider certain other factorse 
For example, a more detailed account of development and 
maintenance costs is requirede It would also be useful to 
know the expected rates of return from the development of 
special attractionse Finally, the paper would be more 
complete if it included a discussion of how specific types 
of clientele can be attracted to a campground and if it 
presented the advantages and disadvantages of pursuing such 
a marketing policye 

The various technical notes found in Chapter VIII 
represent initial attempts at providing program planners and 
policy makers with technical aids to making decisionse AS 
pcinted out in this review, the most important consideration 
in applying these models and other techniques is that they 
should accurately reflect the policy on which they are 
supposedly basede Such techniques must, moreover, clearly 
describe the implications of adopting general policies or 
making decisions related to specific projectSe 

Several different types of perspectives may be taken 
when policies and plans are being developed for the future 
provision of recreation facilitiese One perspective could 
emphasize the importance of existing patterns of behaviour 
and base future plans’ on policies either related to 
modifying these patterns or to encouraging them to continuee 
If this perspective is taken, accurate models of existing 
behaviour are prerequisites to effective policy analysis and 
developmente 

In contrast, another perspective could be taken in 
which a priori assumptions, based on policy, are made about 
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how facilities should relate to populationse A model based 
on these assumptions would be formulated and used _ to 
describe the existing situatione Finally, future plans that 
result in a normative distribution of facilities in relation 
to expected future populations would be developede 

In many cases it can be argued that a policy oor plan 
should never be developed from only one perspectivee Such a 
view, however, should not be taken a8 &4n excuse for using a 
model ill-suited to the objective of the particular analysis 
being undertakene Policy making is often avery complex 
exercise and it is the duty of the researcher or analyst to 
present, accurately and unambiguously, the implications 
associated with adopting various policy perspectivese 


Future Research 


From a review of the nine Technical Notes found in 
Chapter VIII, one can suggest a number of Lines of future 
researche In respect to TN 5 and TN 17, it would be useful 
to develop a means of propating supply according to the 
factors of quality and varietye TN 26 describes how an 
almost infinite list of internal standards can be computeds 
A paper could be written that would outline how to use the 
results of methods described in TN 12 and TN 10 to indicate 
which standards should be considered when making a 
particular allocatione The method proposed for scoring 
projects, described in TN 25, could be improved by the 
incorporation of the concept that a particular dimension may 
not be equally important to all projects on 4 List of the 
candidatese 

Regarding TN 23, further research could centre on 
designing procedures’ to test the sensitivity of linear 
programming for the purpose of allocating land to different 
usSeSe It is doubtful, however, whether the Linear 
programming approach will gain widespread acceptance from 
park planners until techniques for defining the value of a 
particular recreation experience are improvede Further 
empirical work incorporating the concepts developed in TN 31 
offers some promise in this directione Probably», the main 
question regarding impact assessment is the selection of the 
proper level of disaggregation in analysis to obtain an 
accurate picture of who benefits and who pays when a park is 
developede 

The real priorities for research on. modelling and 
allocation must, however, be defined by those individuals 
working at the interface of policy making and researche It 
is only through continuous feedback between these people 
that concensus can be reached about what techniques need to 
be developed for specific types of analysise 
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CHAPTER IX 


TREND ANALYSIS AND PROJECTION ANALYSIS 


INTRODUCTION 


The two Technical Notes in this chapter are 
obviously not the only Technical Notes that have to do 
with trends and projectionse In fact the reader may have 
the distinct feeling that TN 22 is out of place in this 
volume as it is the only one that focuses on a 
straightforward interpretation of data. There is no 
attempt at generalization but only commentary on what 
certain survey results showe However, this TN is 
included because, after the 1972 National Survey of 
Canadian Participation in Outdoor Activities, it was 
recognized that there was no base information available 
on trends in participation in outdoor activitiese Also 
it was recognized that a note like this one would 
implicitly show the problems that arise when definitions 
are changed from year to year and when samples are so 
small that results cannot be presented for certain 
geographic areas for which it is very natural in Canada 
to want to have informatione Specifically, one will note 
the disclaimer that the results for provinces may be so 
inaccurate as to be meaninglesse TN 22 thus shows what 
could be done to define trends in participation in 
outdoor activities in 1972 on the basis of the surveys 
carried out and the CORD Study analysis that had taken 
place to that point in timee 

TN 13 stands in stark contrast in that it contains 
no analysis of data but rather contains proposals for 
what can be done, given the methodological development 
that has taken place in the Canadian Outdoor Recreation 
Demand Studye Between the time when TN 22 was prepared 
and the time that TN 13 was pulled together as a 
presentation to be made to a group of Ontario 
geographers, TN 12 had been completed, TN 6 and 20 were 
in dratt form and the ideas implicit in TN 11 and 20 
were in a preliminary forme Many ideas -had crystalized 
and this made it easy not only to come up with an 
analysis procedure that should be followed but also to 
point out the problems in following it with existing 
datae 

Thus in this section the reader is confronted with 
what may be considered a mundane analysis followed by a 
thought piece that he might think more appropriate for 
the: theory chapter of this volumee Regardless, the 
choice was to include these two notes in this chaptere 
In particular TN 13. lays out a quantitative route to 
follow in carrying out medium run (5 to 15 years) 
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projectionse 

It is unfortunate that there was not a delphi or 
other less conventional projection exercise carried out 
that could be a third paper in this seriese However, 
serious interest in more modern approaches to futures 
research only really began to play a role in the 
thinking of people involved in the CORD Study near the 
end of the studye Such work is now being carried on 
outside the Canadian Outdoor Recreation Demand Studye 
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CORD STUDY TECHNICAL NOTE 22 


TRENDS IN PARTICIPATION 
IN OUTDOOR RECREATION ACTIVITIES 


Se Rousseau 


ABSTRACT 


In this paper data from three national surveys (in 
1967, 1969 and 1972) are compared to provide insight 
into changes in the patterns of participation in outdoor 
recreation activities in Canadae 

After explaining the characteristics of each of the 
surveyS, the trends of the five-year period (1967 - 
1972) are analysed for eight outdoor recreation 
activities on which comparable data was availablee Then 
changes in the incidence of participation between 1969 
and 1972 are analysed for seventeen outdoor recreation 


activitiese Finally, changes in the frequency of 
participation are also briefly analysede 
Nineteen tables are presented throughout the 


discussion to give insight into regional trends, trends 
by age groups, by sex and by socio-economic tevele 
It is concluded that in the general context of 


increasing participation in outdoor recreation 
activities in Canada there are activities and sections 
of Canadian society that show some decrease in 


participation, a fact requiring some explanatione 


INTRODUCTION 


The goal of this paper is to briefly comment on 
data that give insight into trends in patterns of 
Canadian participation in outdoor recreation activitiese 
Data from three national surveyS, called the National 
Surveys, are comparede They give information on 
participation in certain outdoor recreation activities 
in the year prior to the Fall of 1967,-1969 and 1972; 
respectivelye The data permit one to gain insight into 
the Likely trends in participation in outdoor recreation 
activities for Canada as a wholee Moreover, they permit 
one to recognize changes in participation associated 
with region, age group, gender, socio-economic Level, as 
well as a number of other variablese 

Unfortunately, one point must be made abundantly 
clear regarding the figures presented in this papere ALL 
are based on very small samples (the unweighted universe 
for 1569 is, for example, only 3000)-e Thus much of the 
variation, particularly in figures that are reported for 
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smaller provinces, is statisticale One need only compare 
participation rates in activities in Saskatchewan and 
Manitoba for 1969 and 1972 to become suspicious of a 
number of the rates presentede In part this paper has 
been made available to get information out to people so 
that they can react to ite In defense of what has been 
done, one must ultimately ask whether decisions’ on 
issues requiring trend data are better made with some 
information than with none at alle But the reader is 
warned that, rather than deprive him of information, 
sometimes questionable figures have been providede 


THE DATA 


A complete explanation of the data used for this 
paper appears in Chapter 3 of Volume IIIe Table 2 in 
Chapter 3 (Volume III) was an adaption of a Table which 


was prepared for this Notee The questions asked in the 
1969 and 1972 surveys, in order to determine the 
incidence of participation in outdoor recreation 


activities, were identicale However, the activities 
examined were note The question read as follows: 


"Now I'm going to read you a List of outdoor 
activitiese As I read each one, would you tell me 
approximately how many times, if at all, you have 
done each within the past year -—- that is, since 
about this time last yeare" 


Unfortunately, in the 1967 survey, the question was 
formulated in the following way: 


“Here is a list of things that some people tell us 
they like to do out-of-doorse Which of these 
things have you done during the past year with 
your leisure time out-of-doors?" 


As can be seeny the question asked in 1969 and 1972 
permits one to determine not only the incidence but also the 
frequency of participation in outdoor recreation activities, 
while the 1267 question permits one to determine only the 
incidence of participatione This information is sufficient, 
however, to draw trends on the incidence of participation 
but one must be cautious concerning the comparability of the 
1967 results with the results of the two other surveyse 

The 1967 survey yielded information on twenty two 
outdoor recreation activities, the 1968 survey on five 
activities, the 196S survey on twenty six activities and the 
1872 survey on eighteene The 1968 survey data are not used 
in this analysise Its incompatibility with that of the other 
three years can be readily seen in Table 2 (Chapter 3; 
Volume III) and it is this that caused the decision not to 
present 19868 results heree Actually, only eight activities 
were surveyed in an identical manner in 1967, 1969 and 1972, 
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although questions about seventeen activities are worded 
identically in the 1969 and 1972 surveys (see Table ie ) 


TABLE 1 


COMPARABLE ACTIVITIES 
IN THE CORD NATIONAL PARTICIPATION SURVEYS 
1967 - 1972 


Ae Eight Comparable Activites 


Tent—Camping 
Trailer-Camping 
Hunting 

Power Boating 
Canoeing 

Snow Skiing 
Snowmobiling 

Picnics and Cookouts 


Be The Twenty-Two Comparable Activities 


Tent-Camping 

Trailer-Camping 

Hunting 

Power Boating 

Canoeing 

Sailing 

Visit Historic Parks & Sites 
Driving for Pleasure 
Sightseeing from Private Vehicles 
Snow Skiing 

Snowmobiling 

Picnicking 

Walking © Hiking 

Ice Skating 

Horseback Riding 

Bicycling 


So SNe eee 


These facts determined the choice of presenting two 
sections of discussion on trends; the first is concerned 
with the eight activities comparable in the three surveyS; 
and the second on the seventeen activities comparable in the 
1868S and 1972 surveyse The readers who may be interested in 
Knowing more about the four National Surveys referred to 
here or other CORD Study Data may refer to Volume IIIe A 
number of reports that might be of interest are Listed in 
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the Appendix discussed in the following sectione 


TRENDS FROM THE 1967, 1969 AND 1972 NATIONAL SURVEYS 


A more detailed picture of the evolution of the 
incidence of participation in outdoor recreation activities 
in Canada is available in Appendix B of a special version of 
this paper, available from THE LEISURE STUDIES DATA BANK, 
Waterloo Research Institute, at the University of Waterlooe 

As is indicated in Table 2, of the eight selected 
outdoor activities for which data are comparable in 1967, 
1969 and 1972, there is only one activity, Hunting, for 
which there was a steady decline in incidence of 
participation from one survey to the nexte For the other 
seven activities there was an increase in incidence of 
participation between the 1967 survey and the 1972 surveye 
It should be noted; however, that only three activities 
(Power Boating, Canoeing and Snowmobiling) showed a steady 
increase in incidence of participation from one survey to 
the nexte 

In Tables 3 to 10, regional and national trends are 
given for each of the eight outdoor recreation activities 
for which there are data in the 1967, 1969 and 1972 surveyse 

In Table 3 the figures indicate that, in all except the 
Atlantic Provinces, the incidence of participation in Tent- 
camping was higher in 1972 than in 1967. In four provinces 
(Quebec, Manitoba, Saskatchewan and Alberta) the incidence 
of participation increased steadily from 1967 through 1969 
to 1972. 

The figures in Table 4 indicate that the incidence of 
participation in Trailer-camping was higher in 1972 than in 
1967 in all provincese The percentage of individuals 18 
years and over who participated in Trailer-camping tripled 
in Manitoba and Alberta between 1967 and 1972-e There was, 
however, a constant increase between each of the three 
surveys for only two provinces: Manitoba and Saskatchewane 

The figures in Table 5 indicate that the incidence of 
Hunting was greater in 1972 than in 1967 for two provinces 
only: Manitoba and Saskatchewane For Ontario, the percentage 
of individuals 18 years and over who participated in Hunting 
declined steadily from 1967 through 1969 to 1972e For Canada 
as a whole, participation in Hunting declined slightly but 
steadily through the years, although this pattern of steady 
decline does not seem generalized to all provincese Between 
1969 and 1972 percentages of participants increased in two 
regions: the Atlantic Provinces and Manitobae 
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TABLE 2 
PERCENTAGE OF CANADIANS 18 YEARS AND OVER WHO DID 
PARTICIPATE IN 8 OUTDOOR RECREATION ACTIVITIES IN 


Or, 2eeo. AND Lo 72 


ACPLY ITY 1967 : ee wd 2 Fe 

% S S 
Tent-camping 14 ee 19 
Trailer-camping 7 6 10 
Hunting 14 V3 th 
Power boating dhs: 19 2 
Canoeing o 8 10 
Snow Skiing 6 7 J 
Snowmobiling 7 14 18 
Picnics/cookouts away from home 42 54 54 

TABLE 3 


PERCENTAGE OF INDIVIDUALS 18 YEARS AND OVER WHO DID 


Pare CL ATE DN ENT CAMPING IN. FOGF i969 AND? 1o725- BY 


REGION 

TS) oi see Lod 2 
Canada 14 be Am 
Atlantic Provinces LS nae doe 
Qu @bec dL “43 18 
Ontario ie! 10 19 
Manitoba 8 £0 18 
Saskatchewan 8 die ee 
Alberta 16 17 Pe 
British Columbia aL 14 24 
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TN 


PERCENTAGE OF INDIVIDUALS 18 YEARS AND OVER WHO DID 


TABLE 4 


PARTICIPATE IN TRAILER-CAMPING IN 1967, 1969 AND 1972, 


BY REGION 


Canada 

Atilaneic® Provinces 
Québec 

Ontario 

Manitoba 
Saskatchewan 
Alberta 


Brrcishn, Columbia 


PERCENTAGE OF INDIVIDUALS 18 YEARS AND OVER WHO DID 


PARTICIPATE IN HUNTING IN 1967, 1969 AND 1972, BY 


REGION. 


Canada 

Atlantic Provinces 
Québec 

Ontario 

Manitoba 
Saskatchewan 
Alberta 


British Columbia 


10 


TABLE |S 


15 
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14 


16 


Te 
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In contrast to Hunting; the percentages of participants 
in Power Boating (Table 6) increased in all regions between 
1967 and 1972- This increase was steady from each survey to 
the next for three regions: Quebec, Alberta and British 
Columbiae 

Between 1967 and 1972 the percentages of participants 
in Canoeing (Table 7) increased in all regions except the 
Atlantic Provincese Where percentages increased, they did so 


steadily from each survey to the next except in 
Saskatchewane In several regions the increases between 1967 
and 1972 were particularly impressive; in Quebec, 


percentages more than tripled between 1967 and 1572, in 
Manitoba percentages multiplied by 4¢5, in Saskatchewan by 
four and in British Columbia by foure 

Examination of the figures in Table 8 indicates that 
there were no clear trends for participation in Snow Skiing 
between 1967 and 1972-6 However, the increase in 
participation between 1967 and i972 was impressive in 
Manitoba, Saskatchewan and British Columbiae 

Percentages of participants in this activity (Table 9) 
increased in all regions between 1967 and 1972-4 In almost 
all regions this increase in participation was impressive, 
particularly in the Atlantic Provinces where the percentage 
of individuals who participated in Snowmobiling tripled and 
in Saskatchewan where it multiplied by almost foure However, 
British Columbia and Alberta do not seem to have sustained a 
Snowmobiling “boom". 

For all regions the percentages of participants in 
picnicking (Table 10) increased between 1967 and 1972. 
However, the 1972 figures are often not as high as the 1969 
figurese 

AS can be seen from Table Li; the five-year 
participation trend was generally positive for all age 
groups and for the eight activities considerede However, the 
following exception should be noted: participation in 
Hunting declined between 1967 and 1972, for all age groupSe 

In Table 12 it can be seen that the five-year 
participation trend (1867 to 1972) was’ positive for all 
activities for both sexes with the following exceptions: (a) 
the five-year trend for Hunting was negative for both sexes; 
and (b) while the five-year trend (1967 to 1972) for Snow 
Skiing was slightly negative for males, it was positive for 
femalese 
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TABLE 6 
PERCENTAGE OF INDIVIDUALS 18 YEARS AND OVER WHO DID 


PARTICIPATE IN POWER BOATING IN 1967, 1969 AND 1972, 


BY REGION 
1967 1969 1972 
% % S 
Canada LS 19 See 
Atlantic Provinces fh 14 Les 
Québec UL 14 25 
Ontario 19 23 23 
Manitoba cS hy Lt 2h 
Saskatchewan 10 2a 20 
Alberta : gg 18 AS 
British Columbia 20 24 ea 

TABLE 7 


PERCENTAGE OF INDIVIDUALS 18 YEARS AND OVER WHO DID 


PARTICIPATE IN CANOEING IN 1967, 1969 AND 1972, BY REGION 


Loo} 1969 gS ky 
% 3 % 
Canada Ss 8 10 
Atlantic Provinces 3 hee 2 
Québec S 9 Aue 
Ontario 8 10 i. 
Manitoba ve 3 3 
Saskatchewan 1 9 4 
Alberta 6 if 8 
British Columbia 3 5 hy 
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TABLE 5 


a a es eee 


PERCENTAGE OF INDIVIDUALS 18 YEARS AND OVER WHO DID 


PARTICIPATE IN SNOWSKIING IN 1967, 1969 AND 19/2) BY REGION 


1967 1969 ans iy he 

S % % 

Canada 6 7 7 
Atlantic Provinces 4 4 2 
Québec 8 8 9 
Ontario 6 9 6 


Manitoba 2 A 6 
Saskatchewan 2 4 4 
Alberta 7) 8 6 
British Columbia ol 5 LS 


TABLE ? 


PERCENTAGE OF “INDIVIDUALS 18 YEARS ANE OVER WHO DID 


PARTICIPATE IN SNOWMOBILING IN 1967, 1969 AND 1972, BY REGION 


1967 1969 ES 26 

% % % 

Canada 7 14 i& 
Atlantic Provinces 6 uae 18 
Québec ) 21. 25 
Ontario ) 10 ch 
Manitcba 7 4 16 
Saskatchewan 6 26 nS 
Alberta 6 ia 10 
British Columbia 4 
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TABLE 10 ' 


PERCENTAGE OF INDIVIDUALS 18 YEARS AND OVER WHO DID 


PARTICIPATE INtQPICNICKING: IN 1967, L969 (AND 49-725 


BY REGION 
1967 1969 bh i aes 
g % % 
Canada 42 54 54 
Atlantic Provinces 42 50 50 
Québec 33 48 46 
Ontario 44 56 56 
Manitoba 37 38 59 
Saskatchewan 57 69 62 
Alberta 49 68 61 
British Columbia 45 58 55 
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PERCENTAGE OF CANADIANS WHO DID PARTICIPATE IN 8 


SELECTED OUTDOOR RECREATION ACTIVITIES IN wo id's 


TABLE 11 


1969 AND 1972, BY AGE GROUP 


ASE eV PY: 


Tent-camping 
Traller-camping 
ele 

Power boating 
Canoeing 
Snowskiing 
Snowmobiling 


PZtnieking 


TN 22 


AGE GROUP 
ELA reand et, 
40 - 49 
a0T 
Soe 
40 - 49 
50+ 
So les 
40 - 49 
oC 
LD saat the, 
40 - 49 
Uy 
SUS i go 
40 - 49 
20+ 
S48 esteanike te 
40 - 49 
Us 
AUR ciaes 
40 - 49 
50 
oA eee os, 
a0" = 49 
Shi hag 
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TABLE 12 
PERCENTAGE OF CANADIANS WHO DID PARTICIPATE IN 
8 SELECTED OUTDOOR RECREATION ACTIVITIES: IN 1967, 


1969 AND 1972,BY SEX 


ACTIVITIES SEX 1967 1969 na Fy 
% % S 

Tent-camping M 16 te pis: 
F cal 10 16 

Trailer-camping M 7 5 LG 
F 6 6 10 

Hunting M 23 23 19 
F 4 3 3 

Power boating M Ly 24 Lid 
F i 14 20 

Canoeing M 6 nee 12 
F 3 5 8 

Snow Skiing M 8 9 7 
F > 5 8 

Snowmobiling M 8 17 19 
F 6 ve L7 

Picnicking M 36 51 52 
F 47 out 55 
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Table 13 allows one to see the trends in participation 
in outdoor recreation by Socio-Economic Levele The socio- 
economic levels used (Upper, Upper Middle, Middle, Lower 
Middle and Lower) were determined by the interviewers and 
are, at best, approximations of objective socio-economic 
statuse The CORD Study Data Documentation Volume contains 
information on how these level were determinede 

The five-year (1967 to 1972) trend was positive for all 
activities in each socio-economic tlevel with the following 
exceptions: (a) for Hunting, the trend was negative in each 
socio-economic level; and (b) for Snow Skiing; the trend was 
positive for all levels except the Upper Middle Level where 
there was no change from 1967 to 1972.6 


CHANGE IN THE INCIDENCE OF PARTICIPATION 
BETWEEN 1969 AND 1972 


As is evident from the analysis of the previous data, 
it is not infrequent that short term fluctuations in 
participation contradict the Long term trendse Analysis of 
the three-year period fluctuations in participation will not 
help determine long-term trends but will assist in gaining 
an insight into the evolution of participation in seventeen 
outdoor recreation activities between 1969 and 1972- 

As can be seen from Table 14, in Canada as a whole 
between 1869 and 1972, participation decreased slightly in 
four outdoor recreation activities (Hunting, Visiting 
Historic Parks/Sites, Driving for Pleasure and Sightseeing 
from a Private Vehicle )3;5 participation remained stable in 
three activities (Snow Skiing, Picnicking and Horseback 
Riding) and increased in ten activities out of the 
seventeene The increase in participation in Camping with a 
Pick-Up Camper was particularly impressive (100 percent 
increase )e Incidentally, one should note that the decreases 
cited are not incompatible with an increase in total 
activitye Frequency of participation by those who do 
participate has gone upe 

Tables such as Table 14 hawe been prepared for most 
provinces and some metropolitan areas and are presented in 
Appendix B of the special version of this paper cited 
earliere However, Since here the primary concern is with 
trends, Table 15 presents a comparison of the Percentage 
Changes in Participation between 196S and 1972 for Canada, 
most provinces and some metropolitan areaSe 


TN 22 CHAPTER IX page 813 


™N 


TABLE «ads3 
PERCENTAGE OF CANADIANS WHO DID PARTICIPATE IN 8 
SELECTED OUTDOOR RECREATION ACTIVITIES. UN L9G vel ooo ; 
AND 1972, BY SOCIO-ECONOMIC LEVED "(U = Upper, UM “=Upper 


Middle; M = Middle; LM = Lower Middle; L = Lower) 


ACTIVITIES Jape oi 1969 Coy 2 
z % % 
Tent-camping U 14 10 wae] 
UM 18 18 20 
M 14 3 20 
LM 14 ik 20 
L 1 a 18 
Trailer-camping U 8 4 Ai 
it 8 8 tgs 
M 7 6 10 
LM 5 7 9 
L 3 3 i 
HUnCING U Sait Le 9 
UM 14 14 13 
M 14 dia ba 
LM 14 ies 1 
L, ie 1) Ly 
Power boating U Le 24 28 
UM dis] 2 29 
ie AMS 26 
LM es) ny, 20 
L 6 dig’. is! 
Canoeing U 8 es di 
UM 5 7 mt 
M 4 7 9 
LM 4 6 8 
L 2 : 7 
Snow Skiing U ae 16 13 
UM 8 i 8 
M 4 5 6 
LM 4 “ay 5 
L 2 3 4 
Snowmobiling t 10 14 18 
UM 6 14 25 
M 6 14 19 
LM 6 14 19 
L 5 at 13 
Picnicking U 46 59 59 
UM sit 63 58 
M 39 56 58 
LM 39 2 50 
T 28 42 44 
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TABLE 14 


\ 


INCREASE IN THE PERCENTAGE OF CANADIANS WHO DID PARTICIPATE 


IN 17 OUTDOOR RECREATION ACTIVITIES BETWEEN 1969 AND 1972. 


ACTIVITIES 1969 Love (ba) (b-a)100 
(a) (b) a 
; % % % % 
Tent-camping | Le i of 58 
Trailer-camping 6 10 4 67 
Pick-up camper P. 4 2 100 
Hunting 13 ee = 2 sae 1 | 
Power boating 19 ee 4 ei 
Canoeing 8 10 Z 25 
Sailing 3 4 ug in 
Visiting Historic Parks/ 
Sites oy 36 = mid 
Driving for pleasure 67 65 m2 oS 
Sightseeing from private } 
vehicle 43 38 = +12 
Snow Skiing 7 i 0 0 
Snowmobiling 14 18 | 4 28 
Picnicking 54 54 0 0 
Walking-hiking a7 39 2 5 
Ice skating 19 20 a 5 
Horseback riding 8 8. 0 0 
Bicycling L3 Lo 6 46 
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TABLE 16 


\ 


PERCENTAGE INCREASE IN THE INCIDENCE OF PARTICIPATION 


IN 17 OUTDOOR RECREATION ACTIVITES BETWEEN 1969 AND pies Ue gars 


BY AGE GROUP Percentage increase = (% Participants 1972)-(% Participants 1969) X 100 


($ Participants 1969) 


ACTIVITY 18 years Age Age Age Age 

& over Lea 2 ON S0-39 | \Aveae 50 & over 
Tent-camping 58 75 50 33 0 
Trailer-camping 67 67 167 50 50 
Pick-up camper 100 33 67 200 | 100 
Hunting set =O = 25 0 0 
Power boating 28 14 30 30 20 
Canoeing 25 se 38 20 0 
Sailing 33 ae 67 =33 100 
Visiting Historic | 
Parks/Sites -3 8 8 -li -15 
Driving for 
pleasure he ~] 0 LY 0 
Sightseeing from 
private vehicle me 0 | 3h =i6 
Snow Skiing 0 =. 14 100 0 
Snowmobiling 28 29 53 9 Zs 
Picnicking 0 2 2 = 0 
Walking/hiking 5 L3 Bet sel ups 
Ice skating 5 = % 0 8 100 
Horseback riding 0 0 40 50 Os 
Bicycling 46 50 50 25 150 
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TABLE 17 
INCREASE IN PERCENTAGES OF CANADIANS WHO DID PARTICIPATE IN 17 OUTDOOR 


RECREATION ACTIVITIES BETWEEN 1969 AND 1972, ACCORDING TO SEX 


ACTIVITES 1969 I 99-2 weer b= arogiibs 4a kh 2h0'0 
% % S (a) 
% 
Tent-camping M 5 a2 7 47 
F 10 16 6 60 
Trailer-camping M 5 10 5 100 
F 6 10 4 67 
Pick-up camper M 3 4 1 33 
F 2 4 2 100 
Hunting M . 23 Lo -4 = lok 
F 3 3 0 0 
Power boating M 24 oe | 3 vey? 
F 14 20 6 43 
Canoeing M UE eZ i 9 
F 5 8 3 60 
Sailing M 4 4 0 0 
F 5) 4 ui 3 
VisPeing Histon M Wy) 36 =1 =3 
Parks/Sites F oF 36 =| ao 
Driving for pleasure M 66 66 0 0 
F 67 64 =—3 -4 
Sightseeing from M 43 38 =5 ~e2 
private vehicle F 43 38 -5 -12 
Snow Skiing M 9 u a -22 
F 5 8 3 60 
Snowmobiling M Ly 19 24 12 
F 1) Le 6 54 
Picnicking M St 2 1 2 
F 57 SO =Z -4 
Walking-hiking M 22 36 i ; 5 
F 40 42 Z 5 
Ice skating M 23 22 sett -4 
F 16 18 2 12 
Horseback riding M 9 10 a Ey 
F 6 7 us La 
Bicycling M i 19 6 46 
F LZ 20 8 67 
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TABLE 18 
INCREASE IN THE PERCENTAGE OF CANADIANS WHO DID PARTICIPATE. IN. 1-7 
OUTDOOR RECREATION ACTIVITIES BETWEEN 1969 AND 1972, ACCORDING TO SOCIO- 


ECONOMIC LEVEL. 


ACTIVITIES 1969 1972 (b-a) Daa), 200 
(a) (b) (a) 
% % % % 
Tent-camping U 10 Ly "y 70 
UM 18 20 Z a 
M 13 20 7 54 
LM 1 20 9 82 
L 9 18 9 100 
Trailer-camping U 4 fy 8 200 
UM 8 i 5 62 
M 6 10 4 67 
LM 7 9 a 28 
L 3 7 4 133 
Pick-up camper U 2 4 2 100 
UM 2 3 si 50 
M 4 4 0 0 
LM . 4 3 300 
L 2 3 ai 50 
Hunting U De 9 = 3 te 
LM 14 i3 gil ai 
M 11 Ve 0 0 
LM Lo i 2 ~ 15 
L £3 rat ~2 opt Be 
Power boating U 20 28 1 4 
LM 23 29 6 26 
M 16 26 10 62 
LM 17 20 3 18 
L 12 13 1 8 
Canoeing U £3 L5 Z i Boe 
UM 7 Ln 4 57 
M y 9 2 28 
LM 6 8 i 33 
L 7 fe 0 0 
Sailing U 9 10 1 ba 
UM 3 6 3 100 
M 1 3 2 200 
LM 1 3 2 200 
L 2 2 0 0 
Visiting’ Historic U 49 44 -5 +10 
Parks/Sites UM 43 44 i 2 
M 36 35 -] mig 
LM a3 a2 =~] ane 
L 25 26 i 4 
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ACTIVITIES 


Driving for pleasure 


Sightseeing from 


private vehicle 


Snow Skiing 


Snowmobiling 


Picnicking 


Walking-hiking 


Ice skating 


Horseback riding 


BacyclLing 
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TABLE 18 (contd?) 


1969 1972 
(a) (b) 
3 % 
67 71 
73 fe 
68 68 
69 64 
57 SY: 
eal 48 
50 43 
44 40 
40 34 
21 28 
16 13 
7 8 
5. 6 
7 5 
3 4 
14 18 
14 21 
14 19 
14 19 
11 13 
59 59 
63 58 
56 58 
52 50 
42 44 
45 49 
36 41 
39 41 
39 36 
31 31 
28 23 
20 27 
19 19 
18 17 
15 15 
7 8 
6 10 
g 9 
11 8 
6 7 
15 23 
id 22 
12 21 
13 17 
12 14 
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TABLE 19 
FREQUENCY OF PARTICIPATION IN OUTDOOR RECREATION ACTIVITIES BY CANADIANS 


18 YEARS AND OVER. COMPARISON OF THE 1969 AND 1972 8M SURVEYS. 


ACTIVITY FREQUENCY OF 1969 1972 
PARTICIPATION % % 
Swimming NOt at ail 56 A 
1-5 times Ek - 
6-10 times 10 - 
More than 10 times igi ~ 
Tent-camping Not at all 88 81 
1-5 times 9 14 
6-10 times 2 2 
More than 10 times rv js 
Not stated - af 
Trailer-camping Not at all 94 90 
1-5 times 4 6 
6-10 times 1 1 
More than 10 times ay 2 
Not stated = if 
Pick-up camper Not; at’ all 98 96 
1-5 times 1 a 
6-10 times - ~ 
More than 10 times - = 
Hunting Not at aL. 87 89 
1-5 times 10 7 
6-10 times 1 Z 
More than 10 times 2 2 
Power boating Nottvat call 81 pa 
1-5 times Ld. a2 
6-10 times 3 4 
More than 10 times S 6 
Not stated rr 1 
Canoeing NOt at alt : |e ' 90 
1-5 times 6 6 
6-10 times 1 £ 
More than 10 times As 1 
Not stated - tf 
Sailing Not at all a7 96 
1-5 times g. 3 
6-10 times - 1 


More than 10 times - - 
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ACTIVITY 


Water skiing 


Nature study or 
bird watching 


Outdoor Photo- 
graphy 


FA 


Visiting Historic 
Sites/Parks 


Visiting other 


Parks 


DELVING sfLor 


pleasure 


Sightseeing from 
private vehicle 


Climbing 


Snow Skiing 


Snowmobiling 
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TABLE 19 (Cont'd.) 


FREQUENCY 
PARTICIPATION 


Not at all 
1-5 times 
6-10 times 
More than 10 times 


Not at all 
1-5 times 
6-10 times 
More than 10 times 


Not at all 

1-5 times 

6-10 times 

More than 10 times 


Not at all 

1-5 times 

6-10 times 

More than 10 times 
Not stated 


Not at all 

1-5 times 

6-10 times 

More than 10 times 


Not at all 

1-5 times 

6-10 times 

More than 10 times 
Not stated 


Not at all 

1-5 times 

6-10 times 

More than 10 times 
Not stated 


Not at all 

1-5 times 

6-10 times 

More than 10 times 


Not at all 

1-5 times 

6-10 times 

More than 10 times 


Not at all 

1-5 times 

6-10 times 

More than 10 times 
Not stated 
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ACTIVITY 


Snow sledding/ 
tobogganing 


Picnics or cookouts 
away from home 


Walking/hiking 


Golfing 


Ice skating 


Horseback riding 


Bicycling 


Tennis 


Fishing 


TN 22 


TABLE 19 


BREQUENCY “OF 


PARTICIPATION 


Not at all 
1-5 times 
6-10 times 
More than 10 


NOL ere 
1-5. times 
6-10 times 
More than 10 
Not stated 


Not. atwalbh 
1-5 times 
6-10 times 
More than 10 
Wot stated 


NOt aro alt 
1-5 times 
6-10 times 
More than 10 


Not at all 
1-5 times 
6-10 times 
More than 10 
Not stated 


NOt at oa del 
1-5 times 
6-10 times 
More than 10 
Not stated 


NOt at™@arr 
1-5 times 
6-10 times 
More than 10 
Not stated 


Not at all 
1-5. times 
6-10 times 
More than 10 


NOt au ale 
1-5 times 
6-10 times 
More than 10 


times 


times 


times 


times 


times 


times 


times 


times 


times 
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One can see from Table 15 that the range and direction 
of variation in the changes was quite great from one 
activity to another and from one area to anothere Hence, 
there was only one activity, Tent-Camping that had known 
increases in percentages of participants in all areas 
between 1969 and 1972; these increases range between 15 
percent for Saskatchewan and 112 percent for Metropolitan 
Torontoe However, the reader should take note of the points 
made in the special section after the Introductione These 
should have led the reader to expect that much of the 
fluctuation observed in Table 15 is of a statistical nature 
and related to small sample sizee Moreover; there was only 
one area, Manitoba, that had increases in percentages of 
participants in all seventeen activities between 1969 and 
1972- These increases ranged between 8 percent for Driving 
for Pleasure and 500 percent for Snow Skiinge 

Of the 204 entries in this Table, 76 (37 percent) show 
a decrease in participation, 18 (9 percent) show no change 
and 110 (54 percent) show an increasee 

The decreases seem dispersed throughout areas and 
activitiese However, decreases in the four following 
activities were widespread throughout Canada: Visiting 
Historic Park/Sites (decreases in 9 areas out of 12 between 
1969 and 1972), Driving for Pleasure (8 areas out of 12) and 
Hunting (8 areas out of 12). As well, in Saskatchewan 
between 1969 and 1972 there were decreases in participation 
in 12 of the 17 outdoor activitiese 

The comparability of the 1969 and 1972 data permits 
analysis of the evolution of participation in seventeen 
recreation activities between 1969 and 1972 for four Age 
Groups: 18-29, 30-39, 40-49 and 50 and overe 

Of the 85 entries in this table, 20 (24 percent) showed 
decreases in participation between 1969 and 1972, 15 (18 
percent) showed no change; and S50 (58 percent) showed an 
increase in participation between the two periodse 

Even if decrease in participation is not concentrated 
in any particular Age Group, it is in Age Group 40 — 49 that 
decrease is most often noticed, ieee in six activities out 
of seventeense It may also be noticed that while decrease in 
participation for Hunting is concentrated in the 18 -—- 29 and 
30 - 39 age groups, the pattern is quite different for 
Visiting Historic Parks/Sites where decrease in 
participation between 1969 and 1972 occured in the 40 -— 49 
and 50 and over age groupse 

Results shown in Table 17 suggest that even if trends 
for most activities were positive for both males and females 
during the 1969 — 1972 period, increase in percentages of 
Canadians who participated in outdoor recreation activities 
may be quite different according to sexe For instance, while 
the percentage change in Snow Skiing for males was negative 
(22 percent decrease), for female it was positive (60 
percent increase )e 

Even if the socio-economic levels used were determined 
subjectively by the interviewers, this classification may 
help in gaining insight into class patterns of participation 
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in outdoor recreation activities. As shown in Table 18, 
between 1969 and 1972 there was a great range in the 
percentage increases in participation from one activity to 
another and from one socio-economic level to anothere There 
does not, however, seem to be any generalized pattern that 
applies to these changes in percentage participation 
according to economic levele It may be that a more objective 
sociosocio-economic scale would give greater insight. 


CHANGE IN THE FREQUENCY OF PARTICIPATION 
BETWEEN 1969 AND 1972 


The 1969 and 1972 surveys present information not only 
on the incidence of participation in outdoor recreation 
activities but also on the frequency of participatione These 
results are shown in Table 192 When considering only the 
seventeen activities for which there is information on the 
frequency of participation for 1969 and 1972, it can be seen 
that the "more than 10 times" thas a somewhat greater 
percentage for five activities anda lower percentage in 
1972 than in 1969 for one activity, Snow Skiinge These 
findings suggest not only that the incidence of 
participation increased for most outdoor recreation 
activities between 1969 and 1972 but also that the frequency 
of participation slightly increased for most activitiese 


CONCLUSION 
Participation in outdoor recreation activities is 
generally increasinge This is evidenced not only by 
increases in incidence of participation but also by 


increases in frequency of participatione The analysis of 
data for a five-year period shows that for seven activities 
of the eight that could be considered, the trends in 
incidence of participation were on the rise between 1967 and 
1972, even if there were fluctuations between 1967 and 1969 
or between 1969 and 1972. In fact, with some few exceptions, 
the statement is true for all regions and all age groups 
considered, for both sexes and for all socio-economic 
lLevelse 

Changes in percentages of various populations who were 
participants in seventeen outdoor recreation activities 
between 1969 and 1972 were positive changese In Canada as a 
whole there were increases for ten activities out of 
seventeen and decreases for four activitiese However, when 
considering regional patterns, one notices that between 1969 
and 1972 there was only one activity, Tent-—Camping for which 
increases occurred in all regionse As well, there was only 
one province, Manitoba, for which increases occured in all 
activities. 

Obviously the evolution of participation in outdoor 
recreation activities does not follow the same pattern 
throughout all Canadian regions, at least in a short periode 
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Yet, having small samples with which to work Limited what 
can be said about regionat variationge The analysis of 
change in percentages of participation by age groups, sex 
and socio-economic tevels, does suggest that there may be 
important differences in patterns of change according to 
people's socio-economic characteristicse For example, while 
there was a 22 percent decrease in percentage of 
participants in Snow Skiing for males between 1969 and 1972, 
there was a 60 percent increase for femalese 

Even if the general impression is that of upward trends 
in participation in outdoor recreation activities, it would 
not be accurate to neglect the importance of downward 
trendge Hunting is the only activity owt of the eight 
considered for 1967, 1969 and 1972 for which there was a 
steady decrease in percentage of participants in Canada as 4a 
wholee Even with the consistent trend just noted, the 
decrease in Hunting did not occur in two provinces. However, 
the decrease is consistently observed for Age Groups, Sex 
and Socio-economic Levels.e Between 1969 and i972, Hunting, 
Visiting Historic Parks/Sites, Driving for Pleasure and 
Sightseeing from a Private Vehicle had downward trends in 
Canada as a whole and in most regionse In Saskatchewan, the 
percentages of participants declined for twelve out of 
seventeen activities during that periode Decreases in 
Saskatchewan may be related to the changing age distribution 
of the provinces 

It was not the goal of this paper to explain why, in a 
general context of increasing participation in outdoor 
recreation activities in Canada, there are activities or 
sections of Canadian society that show some decrease in 
participation, at Least in the short terme However, the 
fact that this brief review of tabulations that were 
available in the Spring of 1973 raises some interesting 
questions (as well as bringing together some figures that 
may be useful to planners) points up the need for a thorough 
and rigourous analysis of the National Survey data on which 
the tabulations used in preparing this document are basede 
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CORD STUDY TECHNICAL NOTE 13 


STATISTICAL PROJECTIONS 
THAT GO BEYOND PROJECTIONS OF PAST TRENDS 


Je Beaman 


ABSTRACT 


This paper deals with one aspect of making projections: 
the use of quantitative models that allow one to consider 
certain types of social changee The general idea behind 
making projections using an equation Like the one following 
is explained: 


Probability of participation in an outdoor 
recreation activity 


General mean + Income effect + Age effect t+ 
Education effect + Family Composition effect + 
Urbanization effecte 


The author discusses the numerous considerations that 
relate to using such an equatione These are: (1) the 
accuracy of the estimates that are made; (2) the problems 
that arise because of the inadequacy of models due to the 
existence of interaction effects; (3) the need to consider 
supply in making estimates by introducing a further term 
into the equation that reflects amount of supply; (4) the 
need to consider time budget and purpose of participation 
factors; (5) the failure of the model to “work" where 
spontaneous change is involved as opposed to change that is 
either captured in demographic trends or trends in the model 
coefficientse 

In discussing these points the author refers to a 
number of research projects which have clarified such 
matters as what the accuracy of certain projections is, how 
important interaction effects are, how supply may be 
considered in using the kind of equation specifiede 

In concluding portions of the article the author 
acknowledges the relevance of “futures research" techniques 
such as the delphi methode There is an attempt to show what 
the relative role of the quantitative method endorsed should 
be versus (1) simple trend projection and (2) "qualitative" 
futures research techniquese 

As a general guide the author proposes that trend 
extrapolation is good for one to three year projectione Even 
though the analysis of variance technique can be used, it is 
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not suggested because its use is more complicated than trend 
extrapolatione The analysis of variance technique with very 
good models is appropriate as the only projection technique 
to be used for "stable" activities for up to 20 yearsese When 
forecasts are of what will happen more than 20 years in the 
future or when “unstable" activities are considered it is 
proposed that delphi forecasting be used in conjunction with 
analytic methodse 


PURPOSE 


The purpose of this paper is to discuss the making of 
projections using models that hawe some projective efficacy 
because they tap "demographic" components of trendse 


AN INTRODUCTION TO THE PROJECTION STRATEGY ENDORSED 


As indicated Later, the author acknowledges the 
Limitations of classical quantitative methodse Therefore, in 
what follows, the reader should not get the idea that the 
author is only endorsing the use of "quantitative" methodse 
The relevance of work cited in Linstone and Turoff (see 
Reference 11) and other articles and books are seen as very 
relevant to one aspect of the problem which is being 
considered, for example, Mitroff and Turoff (Reference 12), 
Churchman (Reference 3), Turoff (Reference 16) and Greenall 
(Reference 9)e This paper divides "standard quantitative" 
projection techniques into two ltarge classes: (1) ad hoc; or 
the use of a trend tine extension from existing historical 
or time series observations, and (2) structural, or the use 
of explanatory regression equations that take into account 
age, income, family size, etce as variables causally related 
to recreation participatione 

For example, quantitative analysis methods associated 
with the use of structural models that “tap" the influence 
of socio-economic factors on specific groups allow one to 
take into account demographic as well as other trends such 
as changing age composition of a population, changing sex 
composition and changing urbanizatione 

The walues in Figures 1 and 2 are the coefficients 
necessary for defining equations for predicting 
participation in hunting and snowmobiling. Figure i shows 
that there is a strongly negative relationship between 
urbanization and participation in huntinge One sees that age 
effects (shown above the label AGE on the Y-axis) decreases 
from about «13 for males around 20 years of age to —-e14 for 
males 65+. In Figure 2, one can note similar age effects on 
snowmobilinge Information on the data used in deriving the 
coefficients for Equation 1 that are plotted in Figure i is 
provided in TN 126 Literature on the use of such equations 
in recreation research is also cited theree 

In words, the equation developed in TN 12 is based on 
the idea that: 
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FIGURE 1 


RELATIONSHIP BETWEEN PARTICIPATION/NON-PARTICIPATION IN HUNTING 


AND SELECTED SOCIO-ECONOMIC CHARACTERISTICS 


(FOR MALFS) 
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FIGURE 2 


-IPATION /NON-PARTICIPATION IN SNOW MOBILING 


RELATIONSHIP BETWEEN PARTIC 


-ECONOMIC CHARACTERISTICS 


AND SELECTED SOCIO 


(FOR MALES) 
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(1) 
Probability of participating in an outdoor 
recreation activity 


General mean + Income effect + Age effect + 
Education effect + Family compostion effect + 
Urbanization effecte 


As described in TN 12, it is possible to go from such an 
equation for an individual to an equation for predicting 
participation for a population: 


(2) NP = UT + OE Bli,gJIn( ist) + coe DE BC S5,Q)n(55Q) 


It 


WHERE NP the number of participants in an outdoor 


recreation activity in the population considered, 
U = the general mean for an activity, 


T = the number of persons in the 
population considered, 


B(1,J Jeo eB(5,Q) the effect of an individual being in a 
category JeesQ of socio-economic variables 
LleeeS, 

n(1,J Jeeen(5,Q) the number of persons in the population 
considered who are in a category Je2e.Q of 


socio-economic wariable Lee«s5e 


It is important to note that the "urbanization" effects 
shown in Figure 1,» when used in Equation 2, suggest that an 
increase in urbanization wilt result in a decreasing 
participation in hunting even though the effects 
(coefficients) shown graphically in Figure 1 may remain 
constant in the futuree Specifically, Equation 2 indicates 
that the number of people participating in hunting in a 
Given year in Canada depends on: (1) the population of 
Canada in the given year; and, (2) the number of people in 
various socio-economic groupse 

Other shifts, beside urbanization, in the population 
include aging of the population and increasing education 
levelse In Figures 1 and 2 the effects shown above the 
headings Age, Education and Urbanization suggest that being 
old, highly educated and/or being in an urbanized area are 
associated with negative effects on hunting participatione 
Thus three demographic shifts result in an increase in the 
number of people having negative differentials for 
participation in huntinge These demographic shifts in the 
population "explain" the decrease in the number of hunterse 

Total Population is the only variable in which there is 
a change which raises the total number of hunterse However, 
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since Canada's population growth rate is approaching zero, 
there is good reason to believe that in ten years, with 
another ten per cent urbanization in Canada, there will be a 
decrease in hunting participation larger than the -3.59% and 
-5.5% “decline” rates for hunting obtained from the Canadian 
Outdoor Recreation Demand (CORD) Study data (see TN 22)e 
Percentage growth rates are the average annual change in the 
per cent participating per yeare For example, percentage 
growth rate 1967 to 1969 = 100 ((14% participating in 1967 - 
13% participating in 1969)/ (2 years from 1967 to 1969))/14% 
participation in 1967 = 3eS%e 

In the context of the kind of change in hunting 
participation described above, one can tyunderstand”" what is 
happening when trends in certain activities are observede 
Without either going to a Delphi panel or having the results 
of several surveys providing data for a trend line, it is 
possible that reasonable and reliable predictions can be 
madeese In fact, an important point is that the trends in 
rates obtained from surveys may indicate a certain rate of 
increase or decrease in participation in an activity that is 
actually reflecting the way that changes in the age 
distribution, urbanization and income, etce of a population 
are interacting to preduce rates that are observed in the 
years considerede 

When a structural projection is made in which socio- 
economic characteristics are independent variables, one need 
not rely on drawing trend lines through points that are 
determined by complex interactions of demographic factorse 
Rather, one can take into account the changes in socio- 
economic variables that may be expected to eccur in the 
futuree Andy, in this regard, it is important to note that 
socio-economic changes may be projected by accepted "sound" 
methods or are even available from central statistical 
organizations (eege Statistics Canada) so it is possible, 
with some confidence and in some cases Little effort, to see 
what future participation in recreation activities will 
probably bee 


SOME CONSIDERATIONS CONCERNING 
THE USE OF THE PROPOSED PROJECTION STRATEGY 


Accuracy of Estimates 


One obvious evaluation criterion to employ when 
considering the use of the kind of projection approach 
endorsed here is whether the coefficients that must be used 
in the model can be determined accurately enough that they 
could be used in making projectionse CORD TN Noe 6 has been 
developed on this topic so there is little need to discuss 
the issues that are cowered there in detaile However, there 
are a few points which can be noted without entering into 
lengthy discussione Firstly, an important determinant of the 
accuracy of coefficients is the size of the sample on which 
a model is based. A second important factor to consider is 
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whether the socio-economic variables that are used in a 
model are strong determinants of participation in a given 
activitye This is particularly important since some 
activities that have a low level of participation may also 
only have very weak relationships to socio-economic 
characteristicse Thus, extremely large sample sizes in terms 
of a random sample from the population of Canada or a 
particular area may be required to develop certain modelse 

One might suspect that the size of the area for which 
projections are made would be important in relation to the 
accuracy of estimates that can be achievede However, 
inaccuracy in estimates of participation in relation to the 
size of an area for which estimates are made reflect only 
inherent variability in levels of participation in 
relatively small populationse The fact that very highly 
variable levels of participation for small populations may 
be noted in surveys and expected when projections are made 
need not, and usually would not, reflect a deficiency of the 
modelling approache It would reflect a reality of dealing 
with small populationse 

The preceding comments have been based on an analysis 
of variance model Like the one introduced earliere But the 
CORD Study work has already established the effects of 
certain interaction effect variables that were not 
considered in the simple models presented in TN 12.6 
Technical Notes have been prepared which show the structural 
problems of models by pointing out (1) the importance of 
interaction effects between socio-economic variables in 
explaining participation in outdoor recreation activities 
(see TN 20) and (2) the effect of the quantity and quality 
of supply of facilities on individuals! frequency of 
participation (see TN 2, 3, 9 and particularly 29; also see 
the concluding remarks in the review of Chapter VII of this 
volume )e 

There is reason to suspect that model coefficients are 
changinge Over time, changes in the time budgets of people, 
for example the rise in the number of three-day weekends, 
may effect coefficients of many variablese Also, exogenous 
influences not directly related to socio-economic 
characteristics, such as the energy crisis and innovations 
in outdoor recreation equipment, certainly result in a 
changing environment in which people recreates 

Finally, research on National Survey data collected in 
Canada has shown that there is a clustering of people 
according to the activities in which they participate (see 
Reference 14 and the Annex to 1™N10)e This clustering 
reflects the situation that exists because there are Large 
numbers of people who participate in very few similar 
activities, and a few people who participate in very many 
activitiese The existence of the clustering phenomena 
suggests that, ultimately, analysis of variance models (and 
other models that pose a basically linear structure as 
explaining participation in an activity based on 
characteristics of individuals in a population rather than 
considering equations for sub-groups of the populations) 
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will have very serious limitations in terms of making 
accurate predictionses Limitations relate to problems with 
the structure of the linear model and its inability to 
respond in a correct way to changes in the characteristics 
of the population in terms of the manner in which these 
changes are occurringe This results in inaccuracy in 
estimates of participation (see the Future's section of TN 
32 )e 

Model structure and the resultant problems that arise 
when a model is not properly structured has been a recurring 
theme in the preceding commentse Some specific discussion 
sections may help the reader see the nature of special 
"accuracy" problems caused by improperly structured modelse 
These sections may also help the reader see why certain 
lines of research were pursued in the CORD Study to try and 
overcome structural problems with modelse 


interaction Effects 


Interaction effects have received some Limited 
attention in the analysis of recreation behavioure In the 
Meuller and Gurin report of the ORRC Study (see Reference 
13) analysis of variance runs for males and females were 
carried out separately because it was recognized that there 
was a male-female interaction that would have invalidated 
the results of an analysis including both males and femalese 
There are a few articles that incorporate interaction 
effects into the development of models to explain 
participation in outdoor recreation (see Reference 17 and TN 
27)« On the whole, however, there has not been a concerted 
effort to discower which interaction effects usually occur 
in relation to participation in certain activities, or how 
to deal with these effectse 

The essence of the problem is that if, for example, an 
age-education interaction is important in explaining 
participation in hunting (as AID runs with Canadian National 
Survey data have indicated), then part of the variance that 
could tave been explained by an appropriately structured 
analysis of variance model is not explained if the model 
derived does not contain interaction effectse Such a 
structurally deficient model not only does not explain all 
the variance that could be explained, it obviously cannot 
provide any insights into a system in which interactions are 
occurringe Thus problems may arise when the model is applied 
using an equation similar to Equation d3e Using socio- 
economic projections of a population for some later time, 
the effects on participation resulting from changes in the 
distribution of people in various expected age-education 
categories will not be accounted for properlye 

If the structure of a model does not reflect the way 
people behave, but is simply molded to fit a set of static 
data, it should not be expected to provide a valid 
representation of some future realitye Specifically, (1) 
models* efficacy in predicting future patterns is lowered 
and (2) questionable trends may be imputed from the datae 
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The magnitude of these problems depends on the degree of 
interaction in the portion of the population which is being 
modellede Generally, what is the extent of interaction? 
Preliminary results obtained from Canadian National Survey 
data indicate that the interaction effects involwe a sum of 
squares (a measure of explanative power) as ltarge as the sum 
of squares associated with the first order effects, even 
when analyses are run separately for males and females (see 
TN 20)e 


Supply Considerations 


Researchers have known for a long time that the supply 
of facilities for participation in an outdoor recreation 
activity affects the amount of participation that occurse 
However, the exact nature of this relationship has not been 
establishede The work on demand functions for outdoor 
recreation at reservoirs and similar work on demand 
functions for parks has cast some Light on the relationship 
between supply and participatione However, most demand 
function models do not treat supply in a way that allows one 
to generate figures for the total amount of participation 
that will arise ina city that is surrounded by a variety of 
units of supply at various distances (for example, see the 
Literature on demand function models cited in Reference S)e 

Cicchetti, et ale (see Reference 4) have developed an 
equation that shows the relationship between supply and 
participation in activities ina given origin areae Because 
of the unavailability of data for specific locations where 
people participated, Cicchetti's equation applies only to 
aggregate supplye Thus it was not possible to include any 
Locational or distance effects on the supply factore It 
follows that if one attempts to develop a model similar to 
Equation 3, it should contain a supply component in addition 
to various socio-economic characteristicse This supply 
factor should reflect the relative level of supply of 
facilities for a given activity that is available to the 
relevant populatione 

If a model is developed without a supply factor as an 
independent variable then the same problems arise as when a 
model is developed without a measurement of the interaction 
effectse As noted in the last section, part of the variance 
that could be explained by the model is note This variance 
either (1) turns up being counted as error when it is, in 
fact, systematic, or (2) causes distortion in the 
coefficients of the modele The result will be invalid 
projections when the model is applied to both (a) 
participation estimation for specific areas that deviate 
from some norm such as the average national supply of 
National Parks and (b) projections of participation for five 
or ten years for a given area -in which the quantity or 
quality of supply changee 

In recognition of the considerations noted above, some 
CORD Study analysis work has been on the development of the 
supply factor model (see TN 29).s Resources have also been 
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devoted to work on modelling considerations that generalize 
the Cesario origin destination flow model (see TN 3I3d)e 
Promising results have already been achieved in the supply 
factor modelling efforte The work related to the Cesario 
model is of a much more general nature and offers a promise 
of allowing an understanding of the structure of relations 
involwed when a new site is "plugged in” among a group of 
alternative sites and when people modify their behaviour in 
relation to alternative sites which are availablee 

But when one refers to alternatives being available, 
two issues arisee If there exists a system of parks anda 
new, similar park is created, then an alternative to the 
existing parks has been createde However, it is recognized 
that there are alternatives to visiting parks in terms of 
outdoor recreation behavioure Burton (Reference 2) and 
Gillespie (Reference 7) have discussed a methodology for the 
determination of the substitutability of activities by the 
use of a factor analysis procedure. Hendee and Burdge 
(Reference 10) have reviewed a large number of 
considerations related to the substitutability of one 
activity for another (also see Reference 00 for Beaman's 
comments on the Hendee and Burdge article)e In discussing 
apparent trade-offs between activities, Beaman and Leicester 
(see TN 37) suggest that one should be careful te recognize 
that there are substitutable and complementary activities, 
activities such that if people participate, they tend to 
participate in both of the activities or neither.se One need 
only review the works already cited there or by Bishop and 
Witt (Reference 1), which are devoted to operationalizing 
the Ontario TORPS (Travel and Outdoor Recreation Planning 
Study, Reference 6) to see how many academic challenges and 
practical problems relate to understanding substitutabilitye 

Regarding supply considerations, the cluster analysis 
work (see Reference 14 and TN 10) has implications for 
participation in, and trade-offs between, activitiese Some 
of the implications of clustering analysis and problems in 
using factor analysis in determining trade-offs or 
complementarity between activities are noted in TN 32 The 
main point here, however, is that a cluster analysis model 
is a drastically different model than a linear additive 
modele The way in which people respond to supply when there 
is an intricate interrelationship between supply and a 
collection of activities in which people have participated 
is simply not “explained" by a group of linear modelse 
Obviously the supply for any number of activities interacts 
with other participation variables to produce observed 
behaviour and when a number of supply units for a number of 
facilities relate to how decisions are made, there is a 
complex non-linear situation (see the "trade-off" matrix 
specified in Reference i.e ) 

Though important work is proceeding on the effect of 
supply on participation, the results are not yet definitive. 
It must therefore be recognized that models are still 
deficient in terms of the way that supply is introduced into 
the modelling frameworke The result is that it is still 
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Se 


necessary to “Live with" some structural error related to 
supply and behaviour (see the concluding remarks of the 
review of Chapter VII )e 


Time Budget and Purpose of Participation Considerations 


Implicit in much of what has been suggested about 
changing patterns of behaviour and the effect of socio- 
economic variables is a concern with time budgetse One 
component of the supply base for any activity is the 
availability of time to participate in that activitye The 
potentially drastic effects on participation in any number 
of activities if there is a shift to a three— or four-day 
week is obviouSse It is very likely that activities 
participated in during the week involwing little travel may 
be ignored in favour of activities involving more travel. 
But very little demand research has rigorously considered 
expenditures of time and changing supplies of time as these 
affect recreation consumptione Thus, it would be pretentious 
to claim that any recreation research effort has provided a 
really usable way of entering time budget information into a 
supply-demand or supply-participation model (for general 
time budget research, see Reference 15)e The amounts of 
discretionary time, the kind of packages in which it is 
available and the availability of resources to use this time 
have fundamental implications for the amount of 
participation in outdoor recreation activities that will 
eccur (for a classical discussion, see Reference 8)e Until 
better theoretical work is developed and better data sets 
are available for analysis, time budget considerations will 
enter into the participation models in a very inadequate 
waye Thus, the incorporation of time budget information for 
tapping important trends in time use cannot be expected to 
be realized in the near futuree 

The only way it now seems possible to structure models 
to explain some of the trends to the change in time budgets 
is to include gross or aggregate time use information in 
modelse In Canada, two time budget data sets exist that can 
be used in model development (Halifax time budget data 
collected by members of the Dalhousie Institute of Public 
Affairs and Vancouver data collected by members of the 
Department of Sociology, University of British Columbia). 


“Spontaneous" Change vs Trends In Model Coefficients 


Understanding trends in the coefficients that define 
structural models is much more complicated than recognized 
in time budgets as showing sources of variatione The 
discussion above has suggested that models do not adequately 
consider changes in time budgets and supplies of facilities. 
Furthermore, these models have been shown to ignore 
interaction effects and how these change model coefficients 
over timee Implicit in the discussion, however, is the 
notion that in theory ali these sources of change in the 
coefficients can be understood and controllede 
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Though this paper endorses seeking an understanding of 
the structure of relations that "explain" behaviour, several 
practical non-structure oriented procedures can be suggested 
for making necessary projectionse First, when several 
surveys are available that are large enough that model 
coefficients have small standard deviations compared to 
their magnitude, then trends in regression coefficients over 
time should be considered so that, as well as projecting 
demographic characteristics, relevant pro jection 
coefficients may be projectede The author considers this a 
case where demographically induced trends, and trends in the 
related coefficients, may be used ina rigorous way for 
making projectionse This may be done by using both projected 
coefficients and projected population characteristics in 
Equation 3 or a generalization of this equation to include 
supply factors and interaction effectSe« 

Another consideration relates to what might be called 
“spontaneous change". Innovation in a given erea may result 
in the development of the snowmobile or a device that hes 
not yet been marketede Obviously trends in participation in 
presently marginal activities or activities of the future 
cannot be extrapolated to suggest future participation in 
these activitiese It is in these cases that demographically 
induced trends are irrelevant to predicting future tevels of 
participation unless participation in one activity is 
traded-off with another. In this case, valid projections may 
be made for aggregates of activities but not for specific 
ones in a collection of activities among which trade-offs 
are occurringe 

In the context of specific established activities, 
looking at demographically induced trends may be relatively 
useful while this approach has little to offer in the 
context of new or unthought of activitiese For these 
activities, the use of the Delphi technique or other 
futuristic projection techniques offer the only use 
alternativese The boom in snowmobiling was not predictable 
in 1930 in the sense that 45 years ago one could not have 
looked at certain coefficients, age distributions, etce, and 
have predicted the present level of snowmobilinge Further, 
it is almost certain that even with data on snowmobiling in 
1969 and 1972, one cannot come up with realistic projections 
based on current trends and betas of what snowmobiling will 
be in 1990. Snowmobiling in 1969 was in a transition period 
and it is questionable whether some equilibrium had been 
reached only three years lLatere 


CONCLUSION 


In this paper, two Kinds of participation projection 
models have been identifiede The one which is of major 
concern is the structural model which explicitly considers 
change in participation as being directly Linked to socio- 
economic changes in the populations under consideratione 
This type of change may be described as resulting in 
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"demographically induced trends" in participation ratese 

When several surveys on participation rates and 
demographic trends are available, appropriate structural 
parameters can be estimated and trends in these parameters 
can be establishede Given the expected future values of 
these parameters, it becomes possible to project future 
participation ltlevelse This is a logical extension of the 
assumption that coefficients observed at a given time are 
relatively stable, and thus are valid and reliable enough to 
apply five, ten or twenty years in the futuree 

The author believes that when demographic changes are 
the predominant reason for changing participation levels, 
there is good reason to argue that accurate projections of 
participation in the future will only be achieved by the 
procedures endorsed here, rather than by simply projecting 
an observed trend ina dependent variable. However » ifa 
planner is interested in future levels of new activities 
(eege sky-diving or snowmobiling, or the future importance 
of currently non-existent activities which can be expected 
to grow out of future technologies and fads) then the 
"demographically induced trends" projection approach is 
either too risky or impossible. 

Problems in operationalizing the demographic trends 
approach are due to: 


1e accuracy of model coefficients; 
2e problems with model structure; and, 
3e changing model structure over timee 


Therefore, it means that it is necessary to evaluate (a) all 
projections made on the basis of common sense, (b) "check" 
projections made by comparison with simple trend 
projections, (c) make re-evaluation of models as new data, 
allowing new parameter estimates, become available, and (d) 
evaluate and revise models as new research results become 
available allowing improvement of models with existing datae 
Finally, many readers may find it surprising that in 
this paper there has been no discussion of causal modellinge 
Obviously, one problem with the kind of model discussed 
earlier is that the variables do not have the reciprocal 
non-causal relation implied by considering all the 
independent variables as exogenouse It is really only a 
matter of priorities and time that has prevented the use of 
path analysis on the econometric method of multiple-stage 
least-squares to further clarify how models should be 
structurede Certainly the author is interested in seeing the 
value of path coefficients for a model Like that below: 
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Education 


fo 


Income 


nani seg 


Family 


Size of place c : Participation in 
of residence activity X 


Incidentally, in the above the rather strange ferole" of 
age reflects the fact that age with cross sectional data for 
a given year at a given point in time reflects the 
generation and values of a generation by its relation to 
year of birthe ‘ 

In other words, many research steps remain to be taken 
before the general projection procedure discussed here 
becomes a tested tool for use by a technicians Projection is 
still partially "art" as well as "science". A researcher or 
planner should be willing to abandon any "accepted" 
projection methods which have more problems than those that 
were described for the modelling approaches It is necessary 
to continually update and revise the professional's 
repertoire of modelling techniques as improvements are madée 
Unlike many engineering fields, recreation’'s “accepted 
techniques" are generally far from totally satisfactory for 
even “simple" problemse 
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REVIEW 


Taking the two notes in order, one should note that 
many of the deficiencies of TN 22 are obviouse There are no 
indications of which trends cited in the paper are real and 
which are statisticale There Should have been a 
consideration of reliability - at least of statistical 
reliability - so that the reader would know wheat the 
variances are in the various numbers presentede Furthermore, 
there should have been a discussion of vwaliditye 

Before pursuing these issues further, it is fair to 
point out that this paper was put together early in the 
analysis stage of the CORD Study-e What is more, at one point 
estimates of statistical error in various observations were 
prepared but they were never incorporated into the paper 
because by that time interest had diminished and it was not 
considered worthwhile re-writing the paper with statistical 
accuracy estimatese 

One of the chief reasons for this was the results of 
the validity analysis presented in TN 24.6 In contrast to 
reliability, which indicates whether a certain result will 
be reproduced in a similar survey, analysis of validity 
focuses upon whether or not "what" is supposed to be 
measured is being measurede TN 24 thas shown that the 
results in TN 22 are very questionable in many respectse 
Part of the problem relates to definitionse The National 
Survey question about the number of times a person had gone 
fishing in the past year almost certainly resulted in wives 
who went along (but did not fish) replying that that they 
had, in fact, fishede Other validity problems relate to 
sample selections and survey methodologye There are biased 
estimates of numbers of participants and amount of 
participation but the magnitude of biases are not known so 
that the estimates which are being made are not the 
"exactly" correct estimates to give valid figures for 
participation rates, say, in hunting and fishinge 

To give the reader some idea of the magnitude of the 
validity problems involwed, predictions made of the amount 
of use of National Parks and National Historic Sites based 
on data in the 1972 National Survey proved to be more than 
twice the actual use of National Parks and National Historic 
Sitese One reason conjectured for this discrepancy is that 
by the time some people answered the question about use of 
particular National Parks and particular Historic Sites, 
they had forgotten they were supposed to be answering about 
their use during the past year, and, in fact, gave the total 
number of times they had ever used them (or a tleast the 
number of times in the past several years )e Obviously, the 
results obtained cannot be used unless the nature of the 
bias resulting from incorrect answers can be fully 
documented and corrected fore However, a more obvious way of 
making a correction is by seeing that the answer to the 
right question is obtainede But this gets into broader 
issues of recall and the biases in recall which have not 
adequately been investigated as part of outdoor recreation 
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research (CORD Study or other)» Millions have been spent on 
surveys but Little on seeing that the results are valid (as 
opposed to reliable)e Regardless, when one knows that there 
are serious questions about the validity of the results of 
the National Surveys there is little point in hbecoming 
overly sophisticated in defining the amount of statistical 
error involwed in different observationss 

Admitting the inaccuracy of many of the responses, one 
important point concerning TN 22 deserves mentione Even 
though the absolute numbers and percentage participation 
rates in different activities may be incorrect, at is 
possible that trends determined using these rates are etitt 
correcte The fact is that it various groups that were 
sampled incorrectly increased their participation in an 
activity at about the same rate, the percentage rate of 
increase of participation in the activity for the population 
will be estimated correctly even when "absolute" 
participation is estimated incorrectlye Even if the rather 
stringent assumption just stated for the percentage rates of 
increase to be exactly correct is not totally satisfied 
there is every chance that trend information presented in TN 
22 is “more valid" than the absolute figurese Thus, there 
may be some merit in doing statistical error analyses to 
identify trends that are worth examining and those which are 
so dominated by statistical error that they cannot possible 
be accepted as indicating anything reale 

Turning to TN 13, a proposal to do something that (1) 
is hypothetical and (2) contains a critique of what is 
proposed is difficult to comment one It may never be 
possible to adequately define the parameters of models and 
estimate them with sufficient accuracy to do what is 
proposed in the notee In addition, the methodology suggested 
is, as indicated in the note, irrelevant where (a) there are 
major changes in the facilities or programs in an area or 
where policy changes regarding the use of facilities take 
place; (b) where innovation and technological change result 
in discontinuities in behaviour that cannot be modelled by 
the kind of model suggested and (c) where social upheaval 
and/or political change result in changes in (or constraints 
on) behaviour that cause discontinuities for which the model 
cannot possibly accounte The probability that the necessary 
data will be available to develop a good model for any 
particular activity seems slim and even if a model can be 
developed the chance that the results predicted will not be 
invalidated by factor (a), (b) or (c) seems Lowe 

To elaborate on the Latter point, in preparing the 
model, a tot of assumptions are made about social continuity 
and economic stability, etce With a world energy crisis and 
world food crisis on the horizon, modeis that depend on 
stability and continuity do not appear too likely to produce 
good resultse From another perspective, examination of TN 6 
permits one to get some idea of the sample size necessary so 
that survey results can be used in developing a good modele 
TN 20 thas shown that there will be major problems in 
determining all the terms that should be included in a model 
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even if there were 100,000 independent interviews to work 
with and that determining interaction effects is no easy 
matter, although they must be considered if a model is to be 
valide Furthermore TN 29 has shown some of mammoth problems 
involwed in incorporating supply into the kind of model 
proposede Yet supply must be considered if it changes over 
timee 

Given these points, it is really only appropriate to 
consider that TN 13 gives a method that should be applied in 
conjunction with other less quantitative and/or tess 
sophisticated projection approaches (say a Delphi )» It may 
be pretentious to suggest that the method has anything to 
offer that is not as adequately handled by simple 
extrapolation of current trendse Still, it is intellectually 
appealing to suggest that we can correct for demographic 
changee However, unless adopting this complicated analysis 
procedure brings about a commensurate gain in the actual 
quality of the final estimates over simple procedures, the 
use of the procedure is not justifiede So one must ask, when 
making estimates for five (or particularly 10 to 20) years 
in the future, whether common sense may be a better guide 
than a prima face sophisticated quantitative toole The 
answer to such a question is not known and it is truly 
unfortunate that TN 13 or some other note did not make 
specific progress in answering this question for at least 
some circumstancese 
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CHAPTER X 
THE UNITY IMPLICIT IN THE CORD STUDY ANALYSIS 


Je Beaman and NeHe Do 


CORD STUDY TECHNICAL NOTE 33 


A UNIFIED FRAMEWORK FOR UNDERSTANDING 
CANADIAN OUTDOOR RECREATION DEMAND STUDY RESEARCH 


PURPOSE 


The purpose of this Note is to show the inherent unity 
that pervades the Canadian Outdoor Recreation Demand Study 
research work by showing how modelling behavicur has 
ultimately led to the development of theory and methods that 
aid one in appreciating both the nature and complexity of 
interrelationships between apparently differing research 
arease There is also some concern with how modelling 
relates to the development and/or testing of allocation and 
allocation evaluation methodologiese 


INTRODUCTION 


Canadian Outdoor Recreation Demand Study research 
started out as a collection of relatively disjointed taskse 
As one can see from the chapters in this volume, that origin 
modelling, destination modelling, the study of attractivity 
etce were seen as relatively autonomous research arease 
But » something rather revolutionary happened when it was 
recognized that the Cesario model (TN 4), to be interpreted 
in a behavioural ways; should be generalized so that an 
alternative factor was introduced into Cesario's city 
emissiveness (see TN 11 )e This recognition began to bring 
together other disjointed observations to form an increasing 
picture of unity between different research pro jectg. When 
studying attractivity in a comparative perspective (TN 9) it 
was found that to explain the difference between site 
specific attractivity factors and general attractivity 
factors for the use of a park based on people's behaviour, 
there was at least some evidence that a kind of destination 
alternative factor should be introducede Pursuing loose 
ends in TN i resulted in developing TN 3 on alternative 
factorse In TN 3 both origin and destination alternative 
factors were eventually considerede It was also noted that 
in the 1971 proposal for an overnight use model (TN 30) an 
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alternative factor has been suggested which turned out to be 
essentially the one that was independently introduced into 
the generalization of Cesario's model to explain the "gupply 
generates demand" and other effects (TN iide 

It is in the context just introduced that this paper 
was conceivede This paper was to include discussion of 
further generalization of the Cesario model (beyond TN 11) 
to a non-Cesario modet proposed by Beaman and Leicester 
(1970) Firstly a destination alternative factor was to be 


introduced into the Cesario modele Then the Beaman and 
Leicester model was to be presented so substitutability— 
complementarity could be discussede But , after some 


consideration of the deficiencies of the “twice generalize" 
Cesario model, it was concluded that a potential function 
argument compatible with the Cesario model be presented to 
bring in considerations which should play a rote in 
modelling decision makinge 

Prior to presenting the specific generalizations 
finally arrived at and in the way of further introduction it 
is useful to consider what is already implicit in the 
Cesario model and what may constitute a generalizatione For 
example, one may ask what happens when a particular park or 
facility is improved in some respecte Suppose that one 
observes that there is an increase in participatione This 
change in realized demand can be attributed to five more or 
less distinct sources according to Burton (References, 
Chapter IV); 


1- Existing demand — growth in actual demand; 

2e Latent demand — previously frustrated demand; 

3e Induced demand - no desire until opportunity 
presented itself; 

4e Diverted demand - diverted from a similar 
facility elsewhere; 

Se Substituted demand — substituted from one type 
of facility/activity to anothere 


It is possible to suggest that “demand elements" one to 
four above are included to some extent within the Cesario 
model - Existing total demand is characterized in the 
Cesario formulation through the inherent emissiveness of a 
city modified by the “supply generates demand alternative 
factor" effecte Latent demand can be associated with time 
budget variables which are not presently identified on the 
Cesario model but which according to some Technical Notes 
can and should be considered (ecge TN 8 and weekend, holiday 
and weekday use of destinations )e With weekend and holiday 
and weekday models, new participants are implied, by the 
models to see an opportunity that is within their timey 
money or other constraints when a time budget change in 
holidays takes placeée Previously they may have been 
"prevented" from participation but socio-economic changes, 
such as a move to a4 day work week, mean that weekend—- 
holiday models apply to 3/7 or more of the days in a period 
whereas a weekday model is only appropriate for 4/7 not 
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about 5/7 of the dayse It is hard to say whether existing 
or Latent demand or both relate to the proper consideration 
of “time bias" as introduced into the Cesgario model in the 
appendix te TN 38e If people are willing to pay a money 
price, but not willing or able to pay the time—cdistance 
price for use of a site, is the increased use of a new Bite, 
that is closer but for which the users pay the equiwalent of 
travel costs, a reflection of prior frustration of existing 
demand, or of something else? 

Finally "Induced" and "Diverted" demand have clear 
interpretations. Induced demand should be identified with 
“the supply generated demand effect" gince it is an effect 
that,» by definition, relates increased perticipation to 
extra participation “induced" by increasing supplye For 
diverted demand, one need only recognize that attractivity 
changes alter people's choice of a destinatione So 
understanding induced demand in relation to the Cesario 
model involves understanding the formula that indicates how 
total trips generated by an origin are prorated to 
destinations depending on their Locations and 
attractivitiese In commenting on how the various demands 
can be identified in the Cesario model one may have noticed 
an undesirable vagueness particularly regarding the first 
two types of demande And, though there is some vaguenegs 
with respect to all of the first four types of demand it is 
somewhat distressing to notice that substituted demand is 
irrelevant in the Cesario modele At teast this is the case 
until destination alternative factors are introduced in the 
next section of this papere Then substitution at 
destinations is considerede In Section 3 of this paper the 
Cesario model is elaborated upon; so one also sees how 
substituted demand can be identified in an origin based 
substitution mechanisme Still all of the 5 "types of 
demand" referred to here are not very clearly defined and do 
not relate to one well-defined behaviour ef peoplee 
Specifically in the paragraphs above, there was no emphasis 
on the types of user groups for which different Cesario 
models should be formulatede Demand will be affected 
differently by changes in supply for different types of 
userse Different models are necessary for different types 
of users, for different time budget considerations such as 
annual vacation, weekdays after work, weekends etce Having 
such models is a critical matter in developing a better 
understanding of behaviour and particularly a better ability 
to predict behavioure 

The importance of considering user groups has been 
stressed elsewhere in this volume (see TN Te 185 30, 40, 
etce )e Having or using different models is only mentioned 
here because recognizing the need to develop different 
models, for flows for which attractivity or city 
emissiveness will differ, is as much a "generalization" of 
the Cesario model as showing the possible structure of 
emissiveness (as done in TN 11) or as introducing 
destination alternative effects and origin substitutability 
of activities as is done in this paper. It is however 


TN 33 CHAPTER X page 848 


because of (1) the need for using different models is well 
known and because of (2) the few theoretical insights 
following from anything that ig or would be developed in 
this paper that no more is included on the mattere Still, 
this class of generalizations if pursued in considering time 
constraints, capacity of site constraints, influence of 
exogeneous variables such as weather (short term) or changes 
in roads (long term) leads to interesting and extremely 
complicated dynamic model formulations which as indicated in 
TN 40 must ultimately be developed if really good solutions 
to visitor flow and visitor generation by origin problems 
are to be found and used where now only questionable 
estimates are made. 


GENERALIZATION 1: THE DESTINATION ALTERNATIVE FACTOR 


Consider that a person is contemplating making a trip 
with certain activities in mind and that he starts to choose 
between destination areas at which he intends to carry out a 
main activitye Does he think in terms of attractivities 
that are park specific? Though this impression was left in 
the way TN 11 was written, and is implied in TN 4, this need 
not be the casee It is possible that factors other than 
characteristics of a particular park may influence the 
attractivities that a person perceivese These other factors 
may be such that when the Cesario model is used to estimate 
"park attractivities" the attractivities estimated may 
indicate not only something about the parks for which they 
are estimated but about activities that are available around 
theme One may recali that in TN 4 Cesario has stated that 
attractivity which is a property of a destination should not 
be affected by changing a destination's locatione Implicit 
in this statement is "other things being equal", But the 
question is: WHAT other things being equal? One is left 
with the impression that the other things that are to be 
controlled for are the effect of distance and the effect of 
the different city emissivenesses on the use of a particular 
Sites It is hard to read into Cesario's statements the 
notion that the attractivity of a park and its invariance 
with Location involves invariance only holding when one is 
considering moving other parks along with a particular park 
to preserve a particular park's attractivitye Nevertheless, 
consider the problem that arises when a municipal authority, 
a provincial authority and a national authority along with 
private developers create a collection of complementary 
facilities all of which may be fairly similare The user of 
any one facility may perceive the value of the opportunity 
that he has in terms of the value of the total resource base 
available to him including the possibility of using the 
federal campground if the provincial campground is full etce 
If one were trying to understand the attractivity of each of 
these parks and one carried out a Cesario type analysis to 
estimate their attractivity (TN 4) and one only based their 
analysis on characteristics of specific sites they could 
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expect that they would not explain the attractivities of the 
particular sites welle It they recognized that an 
attractivity coefficient estimated using the Cesario model 
probably varies in relation to characteristics of a site and 
the composite collection of sites around it, and carried out 
their analyses with appropriate variables then they could 
expect better results. 

The preceding paragraph mayy howev GF » seem rather 
obtusee So now consider that the Cesario attractivity, 
A(d), for a park for 4 main activity x does depend on what 
is in a park, de This component of attractivity may, for 
example be called the inherent attractiveness, TAC d), for 
the park for x 48s perceived by the certain type of user from 
a group g who is being considered because he goes to d for a 
certain type of visit (eecege weekday use for a picnic: on 
use classification see evge ITN 8 and TN 30)e It is quite 
conceivable that the "Cesario attractivity", A€d)-~ would be 
defined somewhat as fotlows: 


TAC d,xsg) SC agx7e@) EXT ( x» g@sA) 


(1) ACd,x:@) 


WHERE S(CaexX:g@) = IAlasxee) exp (-P( x_9g)D( d,a))/ 
IA( dyx,g@))exp( scx) 


WHERE the sum is over all available locations a for people 
to particpate in Xe 


P(x) is the effective distance on completion for 
people in A group @ carrying out the particular 
activity 


Did,a) is the distance from d to available Location Gy, 


SCX is the substitutability—complementary exponent for 
activity x which indicates the positive or negative 
influence of similar activity being available at other 
like nearby facilities, and 


EXT( x,5g@2A) is a function described immediately 
following which indicates the influence on A(d) of the 
availability of activities defined in the activity 
vector A, other than x when these other activities are 
available to the users of d outside of the destination 
d but relatively near to ite 


No explicit form of the function EXT(x:g,A) is given 
heree One need only understand that this function indicates 
how the availability of activities which are not related to 


major trip purpose but which somewhat influence one’s 


decision about where to go should be taken into account in 
assessing a park perceived or Cesario attractivity, A(d)e 
Having some kind of amusement park facilities in the 
vicinity of a park may influence certain types of user to go 
to a destination de Also it may influence othe types of 
users not to go there so that in the one case, (esg — for 
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uated oe aan a 


e=1), the value of EXT inflates A(d) and in the other (@<«ge- 
for g=2), depresses ite The critical point is: For some 
group of users, g, who are homogeneous in the way that they 
react to given circumstances when they are ona trip of a 
given type for a given purpose, it ig reasonable to claim 
that their trip making behaviour can be influenced by what 
is near to a parke This may be because of the kinds of 
people that are activities available around the park 
attracted to that parke It is this and similar types of 
influences, which should be considered be embodied in the 
EXT functione 

Before proceeding to comment on the other components of 
the right side of Equation 1, it seems important that the 
reader should note that the "Cesario attractivity" defined 
by Aldsxeg@) is the attractivity that would be estimated if 
one used the Cesario model to estimate the parameters of a 
model for trip distribution flow for the people in g who, 
for example, went to the particular site for Picnicking on a 
weekday during a given seasone That is, if emissiveness 
“works" the way that it is postulated to, here or in one of 
the other related generalizations which are possible, the 
very fact that the function (Equation 1) can be written down 
Shows that the Cesario “attractivity" need not be site 
specific but may be influenced by external factors and by 
"competing" or “complementary sites for the activity xe" 
This is true because an A(d) defined by Equation i would be 
estimated correctly using the analysis of variance of trip 
flows as described in TN 4- IA(d) would not be estimated 


unless, as indicated later, d is an "isolated" destinatione 
The difference alluded to is the difference between the 


Cheung and Cesario measures of attractivity which were 


studied in TN Qe 


The reason that @amain activity x is stressed is that 
the formulation presented above is not at all appropriate 
for camping when the dominant activity on a trip is 
something like visiting relatives, Ontario Place, Fortress 
Louisbourg, etce In such cases the site at which one stays 
is possibly simply a location to gain access to the main 
activitye In such circumstances the model is certainly not 
appropriate for assessing destination area attractivitye The 
kind of situation just described is the circumstances 


associated with camping on Prince Edward Island during the 
summere 


Finally the remaining and the key idea being suggested 


in Equation 1 is that if there are a number of facilities at 
|which an activity x can be carried out by a particular type 
of people who are being considered, the inherent 
attractivity of one site influences that of anothere For 
people in group g with a particular trip purpose, one can 
picture sites in some sense emitting potentials so that if a 
person is thinking about a Given location he perceives the 
potential for that given activitye (TN 5 and 17 present 
methods of measuring potential or pressuree ) However, he 
does not really perceive it for, say d, he is always 
considering (1) the inherent attractiveness of a given site 
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(2) augmented by the inherent attractivenesses for 
alternative sites deflated by some function of distanceée 
The fumction of distance can be considered to express how a 
person would react to the separation between the particular 
destination d which may be thought of at a given moment and 
its alternativese These are reacted to 4& if one was at deo 
So, the “impedance of distance coefficient", P(x), iz not 
the same impedance of distance coefficient that applies to 
travel by the particular type of people for their particular 
type of trip in terms of them actually going to destination 
de An impedance coefficient of 2-05 which may apply with 
respect to travel to d is almost certainly targer than the 
P(x,g) of Equation ie Once people are at the site they might 
see distance is offering @ much greater impedance, say an 
impedance of e13 which shows a rather strong reluctance to 
travel more than a few milese 

The ideas suggested above may be more clear it one 
notes what Equation 1 implies under certain conditionse If 
for example all or some sites are "isolated" if that S(x,g) 
is close to 1e0 and so there are no external influences (so 
EXT(x,a@,A) is also about 1400) then using the Cesario model 
one estimates IA(d), site specific attractivities, for these 
sitese Now assume that there is an isolated site that is 
divided into four parts which are close to each other and 
assume that all parts serve the sme function nd that each 
part receives 1/4 of the visitorse The Cesario modet wilt 
show that each part is 1/4 as attractive as the whole 
whereas if one may consider that SCX=-1°: 


Al dyx7e) = 1/4 TAC dyxve) 
TAC dyxe@) = 4 Al dexsa) 


So by the computation above , based on D(d,a)=0, between phe 
four parts, one may see that each part may still be 
considered as inherently attractive by its users as’ the 
total parke 

From the preceding the reader may already have 
concluded that SCX=-1 is a critical wvwaluee For SCxX=-i a 
Situation is defined where dividing a park does not affect 
the inherent attractivity of its "adjacent" partse This 
actually makes behavioural sense within Limits for certain 
types of parks for certain types of users on certin types of 
tripse There is no anomaly in the fact that a park with 3 
campgrounds each with their own resource base are each 
inherently as attractive as the three considerdd as one 
destination with 3 partse In other cases an SCX of -2 would 
be appropriate to show that for some users making 4 “wild 
river" or wilderness park into 4 equally attractive parks 
detract from each so thats: LAC ayxsg)=2 Ald)e One must 
recall that if 1/4 of the visitors that do come to a park go 
to each site, the equation Jjust cited (based on SCX=-2) 
implies that each part of the park will only be seen to be 
inherently 1/2 as attractive as the whole and thus in total 
only 1/4 as many visitors will come if the park is dividede 
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As well, the oppowite could occur with SCY leas than ~1e 
People could perceive the parts as complementary alternative 


destinations and increase their use of the general 
destination area by going to the “different” parks for 
varietye In reality there need be no varietye It ia 


interesting to conjecture on the use one mountain park would 
receive if Banff, Jasper, Yoho and Kootenay were combinede 

The preceding paragraph provides good examples on which 
to comment on why there ig a concern here with both 
complementary and substitutabilitye When the perceived and 
the inherent attractivity of a site ia decreased, because of 
“competition” with or at least the existence of other sites 
offering one the same experience, it seems fair to say that 
the sites are saubstitutes for each othere But » if having 
several parks rather than one Large one or having several 
destinations near each other where one can go if cne area is 
full results in an increase in total use over what the 
destination area as a single entity would receive, then one 
can refer to the sites as complementary To refer to these 
parks as substitutes would at least be a poor or misleading 
use of languages Obviously in terms of the examples in the 
preceding paragraph SCX greater than sinus one defines a 
condition of substitutability whereas if SCX is tess than 
minus one there is complementaritye Nothing more is stated 
now because the relation is commented on again later in 
discussing analagous origin based substitutability— 
complementary conditionge 

One could pursue the intricacies of estimating the 
IA(d»x»g@) values along with the other unknowns in Equation 
1. However, then the kinds of problems discussed in TN i1 
would have to be pursued since in the original estimation 
exercise if “Cesario attractivities" were estimated for 50 
Bites, only 49 independent estimates of attractivity would 
be available on which to estimate 50 inherent attractiveness 
values, an impedance of distance value, a substitutability— 
complementarity exponent and some parameters of a function 
EXT( xysg»A}e Obviously constraints have to be introduced or 
parametric forms of the inherent attractiveness function 
must be estimated so that model parameters can be founde If 
one pursued the alternatives of defining the IA(dyx-g) in 
terms of site characteristics then they may note their 
estimation procedure would be the logical extension of the 
AID analysis of attractivity pursued by Cesgario in TN 4-e 


GENERALIZATION 2: “SUBSTITUTABILITY AND THE DECISION TO 
PRIMARILY DO ACTIVITY "a" AT A MAIN-DESTINATION 
ON A "FAIRLY" SINGLE PURPOSE TRIP 


For this generalization it was planned originally to 
present a formulation by Beaman and Leicester (see Reference 
Chapter VI) which was not developed with the Cesario model 
in minde However, it was then recognized that the important 
ideas of concern could be elucidated by presenting a model 
for which "phase I" emissiveness and attractiveness values 
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could be estimated as Cegario has illustrated (TN 4)e So, 
rather than introducing new modelling concerns it was 
decided to further clarify what Cesario's parameters could 
mean when use of alternative facilities is atlowede 

To present this final generalization of the Cesario 
model it is convenient to introduce it in stages starting 
with very simple assumptions and then introducing other 
matters which must be pursued if overly simplistic 
assumptions about what emissiveness means are to be relaxede 
So initially accept that there is the kind of aggregate as 
indicated in the lLast section might be determined with the 
methods described in TN 10 or a group "g" as described in TN 
19, the behaviour of which is being considerede Accepting 
that these people are assumed to be “homogeneous" in terms 
of their interest in the only main-destination activity, xX» 
that is available meaning that all people in the aggregate 
have the same probability of participating in the given 
activity in similar circumstances, (at the same ltocationy on 
the same day of the week, under the same time constraints, 
etce Je 

Now visualize that at a point in time any one of these 
people has a choice of going to a facility to participate in 
x or not going and that this decision is made in terms of 
"the potential" that the person perceives for participating 
in activity xe In the context of TN 5 one may consider that 
at every point in space at which this person might be, there 
are potentials to participate at different facilities which 
he perceives and that these facility specific potentials are 
defined by: 


(2) PlopgdeXegd=(Ald,x,gdexp(—-r(x»yg) Dl oyd) ) 


WHERE A€dexes@)y r(x»g) and Dlo,sgd) are as defined in TN ii 
except that activity and type of person subscripts have been 
addede 


Also, consider that there is an impact on total 
participation which depends on the best opportunity in «ea way 
defined by a weighted sum of potentials, an alternative 
factor, defined over each of the individual facilities at 
which a person might participates 


(3) Total Participation = Ml o,xX,yg@)JAIU( oy xy g) 
TPC og xg )=( ( P( O_gdyxX_7g)/PP( O7dyxX,g) p**KSE )** SM 
PP( QO, Xe g IPOP( O,X:Q@ CG O9X,a@) 
WHERE again the terms are as defined in TN il 
In specifying the preceeding, it must be accepted that 
a person (party or some other group) may be considered to 
have some rule according to which it is decided which of the 


alternative facilities available is to be visited to 
participate in xe The rule need not be explicitly defined 
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but still trips to the different facilities may be made in 
proportion with different potentials, in Proportion to the 
potentials for participation that are "perceived" for each 
of the alternative facilities (Equation 2)e. Then if 
expected total number of trips in a time period are defined 
in terms of reaction to "total" supply, as defined in 
Equation (3), the amount of use that will be made of a site 
can reagonably be taken to be: 


(3A) VC 09d JD=P( oy dy xy g@ )TP( oy xe g) 


An alternative rule for distributing use to sites is 
that people are economically rational in terme of 
potentials. In this case it would be assumed that one 
always goes to the facility with the highest potentiale 
This formulation is not compatible with the Cesario model 
because zero flows from an origin to all but one destination 
by people ina group g implies zero attractivity for the 
unused sitese But if some are used from other origins, then 
for these origins the first site has zero attractivitye 
These different “manifest" attractivities contradict the 
Cegario formulation which is according to "generalization it 
of this paper, at least based on the assumption that from 
all orgins, a destination area's attractivity is the same 
for comparable userse 

Regardless, at this point one should note that the 
considerations introduced in the last few paragraphs are 
very easily related to the matter raised in TN 28: How does 
@® person relate to the facilities that he knows are around 
him and which in some sense he may perceive as offering him 
opportunities to participate in an activitye In terms of 
Equations i to 3 an answer is offerede In a simple 
conceptualization one visualizes a person in some location 
“impinged on" by potential surfacese For now, the person is 
not concerned about what activity to participate in but only 
where to go given the potentials for x which he perceivese 
He may also be concerned about how often to GO, whether to 
go under given weather conditions etce but the decision 
processes involwed in reacting to these factors are not 
pursued here (the equations here are really for a “seasonal 
average" rather than more detailed (as based on TN 8 
methodology )e 

As indicated in TN 11 the kind of socio-economic 
variables considered in TN 12 and in other Technical Notes 
may be considered to define a general level of participation 
to expect for a person with given characteristicse Or, as 
pointed out in TN 10, 13, 32 and 37 it might be better to 
identify a person with an aggregate of people based on his 
activity package. But, the matter of importance now, is to 
consider that a person is never only exposed to the various 
opportunities to participate in a single activitye It is 
reasonable to ask how one's exposure to multiple 
opportunities, Ay, affects his choice to participate in a 
given activity, xe In the last section it has already been 
indicated that alternatives for the major activity or to the 
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major activity that occur around a destination can be 
considered to affect “inherent" site attractivity, TAC d )e 
For that reason the perceived or manifest attractivity, 
A€Cd), was used in the preceeding equationa to show the 
direct influence on judgements of perciewed attractivitye 
The equations could be written with IA ( )*s substituted for 
AC )*s but the authors belleve this to be inconsistent with 
behaviour and with the TN ii formulation in which the 
estimated A( )*s were used in deriving explicit expressions 
for origin alternative factorse 

With the point about the importance of considering A(d) 
in mind, now accept the obvious that a person perceives the 
potential to participate in a number of activitiess How do 
these different potentials cperate in affecting the person's 
decision to participate in a certain activitye For people 
in the same aggregate, group ££: one may visualize the 
decision to participate in a activity as based on the 
“merit” seen to be associated with each activity in a 
collection of c«éifferent activities based on where each 
person is with respect to various destinations. 

For example, it is plausible to suggest that an 
individual chooses among activities so that the proportion 
of all trips which are made to participate in x can be 
expressed for a person in "g" who Lives at “o" as follows: 


(4) PROP(x;g@,0) = walx,g)PI C*(x)/ OE wela,g)PI C*(a) 
WHERE 


i) the sum is over all a in the activity vector Ae 

2) PI is the row vector of potentials Plosd,yx,g)'s defined 
by Equation 2 for all a in Ae 

3) ws (xeg) is the weight that people assign to the 
“desirability coefficient of activity x" defined by PI 
C*(x) in measuring it against other such coefficientse 

4) C(x) is a vector of complementarity weights c(x»syeg) 
which indicate how much having unit potential for y 


available "complements" negatively or positiwely 


participation in x(clx»sy;g@) is by definition 1 )e 


The stochastic or proportional distribution 
interpretation suggested above might be adopted to account 
for the fact that from day to day, activities shifte 
Shifts, may infact be a reflection of the utility of changes 
in activity patternse Basically the hypothesis introduced 
is that certain weighting vectors are appropriate at any 
point in time to assess the merits that are associated with 
a particular activity for a person ina given aggregates. 
Specifically, one may note that Equation 4 is defined in 


such a way that if there is only one activity available to 


an origin: 
(5) wse(xyg)Plosdexeglcl(xryr7g) 


PROP(xeg@:0)= 929 —--—-—------——---——-------—----- = 1.0 
ws(x,g)Plogdygxe,gdcl(xsy7e) 
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If there are n activities that do not interrelate (ecg 
ec(xeyegd= GO for (x#y) and which have the same potential 
then one can easily derive: 


(6) PROP( x»@,0 J=wal xyg)Plosdyx,g)/ DE walayg )Plo,dyx2 ge) 
WHERE the sum is over Ge 


Equation (6) implies that a person in g distributes his 
activity to the various possibilities in proportion to their 
substitutability weighted potentialae This of course 
involves the implicit assumption that (1) the activities are 
equally substitutable ina “time slot" where participation 
in any activity can occure This involwes the activities 
being equally substitutable under ditferent weather 
conditions etce or it means that (2) in the aggregate 
participation frequencies average out so that Equation (6) 
can be used to explain what happens in the long rune It can 
show average sustitutability based on the different size 
“time slots" being available, under different weather 
conditions etce 

In the context of the ideas presented in the last paragraph 
one can see that PROP(x»g,0) is a ratio which can be 
considered to reduce the inherent emissiveness, Glo), of a 


city for a given activity where alternatives are availablee 


So, in the context of the generalization of the Cesario 


model introduced in TN li, one is now saying that at least 
part of the C(1i,;0) and C(2,0) constant where said to relate 


to "city uniqueness" in TN 11 may not really reflect city 


| uniquenesse In fact if people in group gincity 1 were 


compared with similiar people in g with city 2 it is 


| possible that, say in terms of main-destination day-use of 
parks, C( 1,1 )eNEsC(2,1) or C(1,2)eNEoC( 2,2) merely indicates 
that these constants (defined in TN 11 only) equal the 


appropriate PROP( x»g@20 ) for the two origins and 


: PROP( x» @» lt JDoNEo PROP( x» ge2 do In other words y it is 
: reasonable to suggest that Equations (3B) and (3C) of TN iil 
| should be rewritten to show if it is to be clear that the 
—~CO)*s do not “include” substitutability factors but only 


"truly" reflect city differences other than supply 


 differencese One might for given x and g write: 


(7) CC 1, 29x )gd= PROP( Xe &e 1 JCUNIQ( 1,29x:@) 


(8) CC 2929X%99 = PROP( x» as i JCUNIQ( 1.29x;q@) 


WHERE now it is considered that the CUNIQ( )'s constants 
weally reflect city uniqueneasase 


Incidentally, the fact that Equations (7) and (8) are 
simply a new way of writing emissivenesses which can be 
estimated as discussed in TN 11 is proof that origin 
substitutability—-complementarity may influence the values of 
E( Y's estimated using Cesgario's "phase I" estimatione 
Given that this is true it also follows that if truly good 
models of behaviour are to be built, empirical work on 
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assessing or measuring substitutability should proceede 
However, given that results presented in TN 29 show how 
difficult and potentially coatly it is to #88683 response to 
just one type of supply, it must be recognized that work on 
substitutability—complementarity should be carefully 
designed and proceed slowly from one success to another 
rather than being undertaken in a grand experiment. Any 
grand project will have Little chance of success given the 
problems to be solwed in “torigin emissiveness" studies; and 
their relation to visitor flow analysis problems to be 
solved; and the relation of these problems to other problems 
as described in the next section of this papere 

Before turning to the next section some practical 
insights regarding “city uniqueness" and complementarity- 
substitutability may make the preceeding formulation more 
understandablee Firstly regarding substitution; one may 
note that near some cities, Ottaway, for example, there are 
abundant opportunities for cottaging where in other 
Locations this is not the caseée In some provinces there are 
abundant crown lands that can be used for camping,s hunting 
or fishing with few restrictionse In others one or all 
these activities are rigidly eontrollede Now both cottaging 
(not in a park) and use of non-park crown tand for camping — 
"compete" with using parks for anythinge When people 
allocate their finite time and money to participating in 
activities they make allowance for substitutese So here one 
may note that formulae presented earlier should and do atlow 
the effect of having such substitutes available to be. 
accounted for in estimating particiption in activitiese 
Complementarity is another matter as Beaman and Leicester 
(1970) have stressed and it has often not been dealt with 
adequately by those analysing behaviour or proposing 
analysis strategies (see TN 37 and comments in TN 10 and tN 
32 )e The opportunity to go shopping in Alma townsite 
outside Fundy National Park or to drive for pleasure to the 
horse races and shop and eat in warious pleces in PEI. 
complement a visit to PEI National Park for a certain type 
of visitor but in this paper such influences should he 
related to Generalization 1 and destination alternative 
factors affecting a destination's inherent attractivenesse 
Rather, having a high potential at an origin to visit 
historic sites may not really relate to high emissiveness to 
picnic grounds because the attractiveness ot particular 
picnic grounds is enhanced by historic sites being near 
theme People may picnic when or 48 long as the weather is 
good and then "retreat" to a complementary activity when the 
weather is bade Another case is where a person would not go 
to a park unless the trip offered the complementary 
experience of a drive for pleasures The general potential 
for driving for pleasure in going to parks from Ottawa may 
“'enhance" the "raw" potential for visiting parks whereas 
traffic problems in Toronto or Lack of scenery in 
Saskatchewan near Regina may reduce traw" potential for park 
visits (c(x~ey:g@)'s can be negative or positive )e 

Finally, to focus again on 4a4n important concerns 


a — Ri e 


a a ea 


no eee eal lk rg eS OS 
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implicit in the idea that the person who is being considered 
is a certain type of person (the issue taken up in TN 10, 
32, 37 and to some degree in notes like TN 8, 14, 37, 30) is 
the followings If a person is an outdoor orientated person 
with an orientation to exurban rather than urban outdoor 
activities then at a given point in time, let us say after 
work on a given day with good weather the person may have a 
greater tendency than an athletic activities oriented person 
to rush outside the city to some area for a brief hikes The 
other person alluded to might go to a nearby tennis court 
and play tennis or to go to an indoor area and play squashe 
But returning to the main concern if both persons referred 
to are confronted with the same geographic configuration of 
facilities at which to carry a number of different 
activities (which may alao include social and cultural 
activities) and have the game time constraints and weather 
conditions each may be viewed as perceiving different 
attractivities for different facilities and thus as 
assigning different weights to the merits of carrying 
certain activities. 

Now» it would be possible to turn to estimation 
concerns in relation to Generalization II, but as was the 
case with Generalization I Little ig saide Certainly in 
1976 from a modelling perspective it ig not feasible to 
think that information for calibrating the kind of 
generalized model described here is availablee Some detail 
must be sacrificede So, one ig in all practical studies 
dealing with a modelling approach that yields an “aggregate" 
solution for “an average individual" in particular 
circumstancee Such models can (or really must) certainly be 
adequate for most purposes for nowe One for now must view 
the generalized model asa conceptual guide in research 
planninge It can be used in deciding where and how, simple 
models can be used in certain circumstancese However based 
on results presented in TN 11 one must accept that "Phase [" 
attractivity and emissiveness estimates based on the Cesario 
model are influenced by factors that are highly significant 
but which have not been explained by origin or destination 
characteristicse Until more is learned, even our best 
models cannot be expected to yield really accurate origin 
destination flow estimates. Possibly it is time to use the 
TN 19 methods for finding structural problems in models so 
as to derive better models as soon as possible, rather than 
continuing expenditures on poor ones without even 
establishing how poor they aree 


DISCUSSION: THE UNITY IN THE CORD STUDY 


Summary Overview 


As suggested earlier in this paper when the importance 
of understanding what the Cesgario emissiveness coefficient 
meant was realized, a path for discovering the unity in CORD 
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Study research was openede As work on TN iil progressed the 
following possible elements of Cesario's emiasiveness for an 
area were recognized in the formulation developed: 


ie Its dependence on the general array of 

facilities of a given type available to a 
person in the area who wants to make a given 
kind of trip - 

a) depends on how total trips relate to 

the configuration of supply for an area 

and 

b) depends on how toctal participation at 

all facilities is prorated to specific 

facilities; 


2e Its dependence on the “sggregate" socio- 
economic characteristics of people of the area; 


3o Its dependence on the unique character of the 
area being considered; its dependence on 
either a special geographic character or a 
special socio-cultural charactere 


The study of Cegario's attractivity measures for parks 
in an attempt to understand what it meant ted to other 
insights already alluded to in the article: 


4e Cesario's park attractiveness need not be 
considered a pure measure of how attractive a 
park is but for certain types of trips for a 
park may depend on what is around the park - 
a) in terms of similar competing 
facilities and 
b) in terms of different competing or 
"complementary" facilities; 


which may either add to, or detract from the attractiveness 
of the park being considered depending on what users ina 
g@iven group there wante 

Finally, the “origin based" substitutability 
considerations introduced show "how", in the context of the 
formulation developed in TN ii: 


5e The inherent emissiveness of a city, which 
really applies to participation in an activity 
where there are no other activities to 
consider, can be properly dealt with if one 
introduces an effect on inherent emissiveness 
for an activity "a" of the opportunity to 
participate in other activities. It can be 
properly dealt with if one introduces an origin 
based substituability—-complementarity factore 


It has been repeated numerous times in commenting on 
the general tlogic behind the points received above that one 
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must be concerned with (1) attractiveness to whom for what 
purpose, (2) substitutes for what type of person under what 
type of time budget, weather etc. constraintse If one 
cannot identify types of people, types of activities, 
tradeoffs between activities for groups for which they ere 
meaningful, then there is no point in “identifying" them for 
behaviourally meaningless aggregates unless this can be 
justified for developing an approximate modele The 
exception is if a model is built for policy purposes then 
any structure can be specifiede It ia perfectly valid to 


investigate how people would behave if eece or to 
investigage the ultimate consequences of encouraging 
behaviour that conforms to a given modele Regardleags, the 


point here is that (1) to (5) above, in this summary 
overview", should be thought of in terms of the behaviour of 
a particular person or a particular type of people, if the 
statements are to have the specific behavioural implications 
which they appear to havee 


UNITY AND “ALTERNATIVE FACTOR MEASUREMENT“ 


However, enough has been said in general overviewe It 
may not be clear from the terse notes that most of the 
interrelations indicated in Figure i can be understood in 
terms of the points raised about the Cesario modele For 
example, 

to start to see in detail why the connections between 
research concerns depicted do occur consider firstly how 
alternatives factors, their definition and measurement, 
relates to other research areas shown in Figure tle 
Alternative factors in TN 3 have been the focus of theory 
development and “considerations of structural problems with 
models" work because of the way they were used in TN 1 and 
their proposed use in TN 306 In TN 1 questions were raised 
about the significance of the alternative factor but of more 
importance regarding “Calibration and Testing of Models" and 
analysis and survey methodology is the fact that questions 
raised about the CORD Study Park User Survey and the 
possibility of calibrating the models proposed in TN 30 
resulted in the cancellation of empirical work on the 
"“Overnight-Use Model". 

In TN 1 and later in TN 11 one has clear indication of 
the importance of alternative factors in "Futures Research 
and Projections", obviously as new facilities are created 
and some old facilities are phased out, models must change 
to reflect the supply configuration with which people are 
confrontede Alternative type factors are obviously key 
model elements for reflecting this changee This is because 
they reflect very directly that when one is formulating 
destination models they should really (or are really) 
involved in “Systems Models Development". 

A number of other observations point up how alternative 
factors keep being noted as important with respect to other 
research arease An origin alternative factor appeared when 
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CHAPTER X 


TN 33 


the generalized Cesario model was formulated in TN ile Also 
in this paper it has been shown that destination alternative 
factors should probably be considered if one is to truly 
understand what Cesgario's estimated attractivities meane Of 
more importance is that in TN iia link between Cesario's 
emissiveness values and the parameters of an origin model 
was specifiede This is because the combination of *supply 
generates demand" effects of a prorating of use effect, and 
of an alternative type factor (of TN 11) can be seen to 
relate to the “effect of supply" or supply measurement 
problem exposed in TN 29-4 


ORIGIN (EMESSIVENESS) MODEL DEVELOPMENT 


Though it is not worthwhile to deal with every heading 
in Figure 1 and discuss all possible relationships between 
headings, a second example may convince the reader of the 
degree to which "Model Definition® research should be 
integratede Material in TN 11 on origin emissiveness has 
pointed up that alternatives to a given facility at which 
one can participate in an activity may influence origin 
emissivenesse Actually TN 29 can be viewed as another paper 
on how alternative supply and its measurement must be 
considered in measuring origin emissivenesse In TN 29 
important statistical considerations, important analysis and 
survey methods concerning efficient data collection, model 
calibration and survey design are raisede 

TN 35, the appendix to this volume, and the review for 
Chapter 7 concern statistical issues of importance in 
emissiveness analysise Other articles deal with more 
behaviourally oriented methodological problems with origin 
modelse They raise difficulties with the method of measuring 
origin emissiveness (TN 6, 13, and 20)e Still, TN 13 relates 
the TIN 12 work on applying origin models to "Futures 
Research and Projections" while TN 6 answers questions about 
the accuracy of estimates of origin emissiveness using the 
methods described in TN 12 and TN i3.- 

TN 10, TN 13, TN 32, TN 37 and some chapter review 
material raise important THEORY, "Calibration and Testing of 
Models of Measurement" and Normative considerations in 
relation to origin emissivenesse Such concerns as the 
equity of distribution of the emissiveness to different 
groups in the population relates to theory issues in TN 32, 
methodological points in TN 10, TN 13, and TN 37¢ Also they 
issue of how facility complementarity—-substitutability 
affect origin emissiveness is broachede | 


OTHER MATTERS 
The preceding has concentrated on origin models, 
destination models attractivity, alternative factors, the 


major effect of supplye One may ask is there any unity in 
the CORD study with respect to some of the other topics that 
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were dealt with in technical notege Im Figure i, it is 
possible to delineate three lines of consideration that do 
not have much to do specifically with the kind cof modetling 
already mentioned: allocation modelling; degign of data 
collection; and theory development and theory testinge The 
unity of the study with respect to these areas is fairly 
obviouss What one has seen in this article is in fact how 
the results of one empirical research project are 
interrelated with other research projects leading to theory 
developments In particular, with respect to research design; 
it should be noted that TN 6 with respect to origin models, 
TN 19 with respect to destination models, have presented 
formulas which have utility in determining what sample sizes 
are necessary if models are to be redeveloped that meet 
certain accuracy criteriae In a Similar way one can see the 
explicit statements in TN 20 on thow to improve origin 
models; and in TN 35 on how to improve destination models 
(actually, comments on improving destination models occur in 
a large number of TN including 3; 7, 40.2) 

And now starting from the perspective introduced in «@ 
number of places in this volume, it is censistent to 
maintain that allocation evaluation models need not be 
consistent with behavioure Pressures computed in Technical 
Notes such as in TN 17 need in no way reflect real pressures 
felt by peoplee Rather, what the method may be used to doy, 
is to produce a measure of people’s relationships to 
resourcese If they don't feel deprived at all then it may 
still be the case that a policy decision will mean that 
actions on their "needs" will be takene Similarly, the 
question as to whether the potential surfaces generated by 
the methods described in TN 5 really reflect pressures that 
could be documented by a study is not necessarily considered 
to be a pointed issue in this volumes Unity in theory with 
theory or with empirical work is obviously of concern to 
some of the authors, as seen in TN 5Se 


SPECIAL RELATIONSHIPS THAT DESERVE MENTION 


The preceding has been primarily about model 
definition, theory and analysis and survey methodology=s 
Still there has been some mention of normative concerns and 
allocatione It is the perspective, introduced in a number 
of places in this volume, that it is consistent to maintain 
that allocation evaluation models need not be consistent 
with behaviour which results in the loose link between 
chapter 8 Technical Notes and other Notese This is because 
one beliewes that the Technical Notes in these chapters 
either (1) atlow one to see many problems to be solved to 
improve allocation and evaluation models based on behaviour 
or (2) forces one to recognize that researchers must do much 
work with policy makers, managers and planners to define 
objectives, to define goal-achievement standards and to 
define acceptable ways of assessing what should be in terms 
of policy and political perspectivese Specifically some 
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researchers believe that the "pressures" computed in 
Technical Notes such as in TN 5 and 17 should reflect real 
pressures felt by peoplee Some believe consumer surplus as 
measured in TN 31 and 38 is “real” measure comparable with 
real dollars and produces a measure of people's 
relationships to resourcese If people don't feel deprived 
or are not willing to pay the price consumer surplus 
suggests policy action will still be taken because _ the 
models are accepted and usede 

Similar concerns with the relation of models to 
behaviour modelling is also clearly of major concern in TN 
23, 25 and 266 

In TN 41 there is a dichotomye This is expressed in 
the question: Was the kind of research done a good use of 
resources at the time or should the policy issues have been 
clarified to indicate if any empirical research should even 
be donee Similarly TN 40 raises questions about the 
relevance of the economic impact study approach discussed in 
T™N 39 and elaborated on in this other note (TN 40). Can 
policy be empirically derived is the critical question that 
underlies different ways of viewing the notes in Chapter 8 
in relation to other notese A methodological question that 
raises issues about modelling and policy is whether models 
derived on data when one policy is in effect are even 
relevant in predicting behaviour under a quite different 
policyse The pessimist will see behaviour based models as 
primarily relevant to planning based on the status quoe 


CONCLUSION 


This article has documented the progress of the 
Canadian Outdoor Recreation Demands Study and a unity in it, 
at least in terms of progressing toward a goal of being able 
to predict people's behaviour and to relate it to policy 
oriented modelse However, it is fair to say that the 
interrelationships that should be considered in predicting 
recreation behaviour are frighteningly complexe In this 
regard the model implicit in the "generalized framework" for 
modelling which has been described is not a model which is 
going to be calibrated in the foreseeable future or even 
totally defined in the foreseeable futuree It is only a 
model which serves as a guide to thinkinge 

From a broad perspective it must be admitted that the 
rather substantial results reviewed in this chapter when 
considered in perspective do not really show that much 


progress has been made in modelling or understanding 
people's participation in outdoor activitiese This chapter 
has concentrated on models which are good for “main- 


destination" "fairly single purpose trips". What part of 
recreation trips fall into this categorye What planning or 
management needs for information can be met by such modelse 
Then there is the effect of substitution, there are problems 
of defining user types for which various models are useful 
etce etce Andy always, there is the lingering doubt: 
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Where the balance should be between using “quick and dirty" 
methods with "due consideration" of what can be achieved by 
advertising that is sometimes needed and justified in 
dealing with the planning and management problems on which 
researchers work but which will newer develop if each or 
most research projects do not contribute to a growing 
knowledge« 

But even if research methods do develop, possibly the 
most important concern must be with how research on 
behaviour should relate to policy evaluation and the 
development of policye This is a problem area as indicated 
in Chapter 8 and it is barely explored in this paper or in 
this volume» The critical consideration is raised in Volume 
I and that is that researchers must not be technicians but 
interact with planners and managers to develop an 
understanding of policies and policy problems so that they 
may make their proper contribution of indicating where and 
how research can be used and discourage its misusee« 


TN 33 CHAPTER X page 866 


CHAPTER XI 


CORD STUDY RESEARCH - FINDINGS AND CONCLUSIONS: 
AN INTREPERTATION 


Jo Beaman, Ee Avedon, Te Stewart 


Some readers, may have expected the final CORD Study 
report to indicate national and provincial recreation 
resource requirements for the next decade or 806 An 
objective such as this could never have been reached for a 
multiplicity of reasongse In general, these reasons fall into 
three major groups: administrative, epistemological, and 
date collectione The three final volumes of The CORD Study 
attempt to document these reasonse 

Volume I in this series indicates some of the problems 
inherent in The Study within an administrative contexte The 
Volume describes the massiveness of the undertaking, and 
management problems which both helped and hindered the 
conduct of The Studye Volume I also examines the many Study 
issues related to the interaction between policy, planning, 
and researche 

Volume III describes the empirical issues confronted in 
The CORD Studye An attempt has been made in Volume III to 
illustrate the many problems inherent in collecting and 
processing national datae Issues regarding sample design, 
questionnaire construction, data collection, computer 
processing, and the Like, are described in detail, in order 
to offer readers the opportunity to learn from the mistakes 
made during conduct of The CORD Studye 

Volume II, in contrast, is indeed "the heart" of The 
CORD Studye Very early in the conduct of The Study it was 
recognized that numerous epistemological issues had to be 
solved before much “true" CORD Study analysis could take 
placee The “deeper" CORD Study researchers became involved 
with epistemological issues, the more they found themselves 
on the periphery of demand analysis methodologye They found 
that they were "forced" to go beyond what was known in order 
to attempt to achieve CORD Study ob jectivese 

Thus», although the original study design did not 
envision methodological research as the dominant focus of 
The CORD Study, the sheer massiveness of this tatter 
undertaking (of necessity) overshadowed all other aspectse 
As a consequence, CORD Study research (in spite of a range 


of severe Limitations) has made a major technical 
contribution, not only to the nation and to the provinces, 
but to recreation research in generale Volume II offers 


readers a knowledge of what CORD Study researchers have 
learned within an epistemological contexte 

This chapter then, is an attempt to summarize 
epistemological contributions of The CORD Study, ( those 
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which seemed particularly important in the research context 
of the authors as of 1976) and indicate some of the 
questions that are still to be answered through future 
research effortse 


DESTINATION MODELLING 


TN 1 established that a systems model could be defined 
to explain the day-use of parkse The model is a systems 
model because of the manner in which the aiternative factor 
is definede Since almost no variance was associated with the 
alternative factor or with the attractivity factor, these 
factors had no practical importancee (However, it must be 
noted that the regression analysis carried out was not 
completely valid, and thus the variance explained by the 
entire systems model was spuriously highe) This latter issue 
is explained in some detail in TN 19 and 35e Nevertheless, 
despite the structural problems in the model, some progress 
in this aspect of study has been madee The model used for 
the analysis presented in TN 7 relied upon the model derived 
in TN 13: consequently, there is nothing that can be tearned 
about the empirical walidity of the model presented in TN 7e 
A lack of availability of data and better models prompted 
the approach presented in TN 7 and also provides insight 
into the research needs alluded to in TN 30 and 406. 

The potential for formulating a good enroute-overnight 
park-use model was illustrated in TN 18e A concern for the 
accuracy of potential estimates is illustrated by the 
strategy of developing a model for one set of parks and then 
using the model to predict the use of another set of parkse 
However, it is am open question as to whether additional 
effort should have been made to examine alternative 
structures for the modele Availability of adequate data was 
a major problem in developing the model in the first place, 
and this prompted the formulation and testing of only one 
model structureec TN 18 perhaps serves as an object lesson 
for researchers who try to do more research than is feasible 
with the data they have at hande 

Basic parameters characterizing park-use (as indicated 
im TN 4) can be derived with a minimum of assumptions about 
how characteristics of origins of park users influence the 
emissiveness of the origin, and about how attractiveness of 
parks should be definede Fortunately, the original research 
plan for TN 4 was altered, since the analysis proposed was 
not feasible with the type of data under consideratione in 
this instance, reliable alternative data were obtained; 
however, certain key data on visitor types, length of stay; 
weekend or weekday use, and the Like, were not available, 
indicating once again the need to carefuily plan data 
collectione The results obtained should not be construed as 
an indication that a "good" model had been developede (See 
TN 35 and ile) Much of what one might think can be learned 
about participation at various destinations is not to be 
found in the attractiveness or emissiveness coefficients 
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computede Comments regarding steuctural shortcomings in this 
model are presented in TN 11 and 33e These comments also 
foreshadow the difficulties inherent in generalizing a 
modelo 

In order that the reader not be misled about 
application of TN 4 results, TN 11 attempts to make clear 
that emmisiweness values should not he expected to be a 
"fundamental" parameter of a citye These parameters almost 
certainly depend upon the aggregate demographic 
characteristics of the city's population, and to a greater 
degree, reflect a city's relationship to its supply of 
resources for recreatione TN 9, as well as IN 33 indicate 
that there is good reason to believe that attractiveness 
measures are deceptive, in that they are not Simply a 
measure of the attractiveness of a particuar parke 
Seemingly, other parks around a given park influence the 
attractivity of the park in questione Attractiveness and 
emissiveness parameters cannot be computed for the use of a 
new park using IN 4 methodse’ A model for which parameters 
can be estimated is presented in TN 11 and it is predicated 
upon reasonably plausible behavioural assumptionse As a 
consequence, even though both the TN 4 models are 
structurally compatible, the model presented in TN ii can 
theoretically result in more than parameter estimatese This 
is because the theoretical formulation presented in TN il 
indicates how park-use varies with the supply for a given 
modele Extending this approach (as in TN 33) can permit the 
development of more structurally sound model we 
Unfortunately, although several variations of the model 
could have been formulated, this did not occure 

A useful perspective for modelling behaviour is 
presented in TN 8e Researchers often employ one model to 
explain park visits on both weekdays and weekendse There is 
no reason to believe that weekday park-use should be 
"explained" by the same model which is used to Yexplain"™ 
weekend behaviour. This is also true for different types of 
park-userse ™T 8 offers suggestions for the kinds of 
functions to be derived for these different typese (TN 40 
comments on this latter pointe) Confirmation of parameter 
estimates obtained using the “ordinary Least-squares 
regression approach" is discussed in TN 19- This approach is 
not statistically efficients By using the regression 
approach proposed in TN 19, better estimates of parameters 
can usually be obtained with only half the data that are 
required for the former approache 

Many of the insights gained from the analysis presented 
in TN 35, are similar to those reported in IN 19- The 


le If a new park will not be close to other park sites, and 
will have little influence on total park use by residents of 
various cities, then it may be assumed that no city's 
emissiveness will be substantially influenced by the 
inclusion of the new site in the existing park systeme 
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simulation approach pursued in TN 35 provides another way of 
testing structural adequacy of modeis - a method which some 
researchers prefer over the method discussed in TN 19- The 
important point however in TN 35, is that accuracy criteria 
appropriate to the problem being considered must be 
developed in order to derive "good" modelse Overall X? and 
individual residual X* are extremely important in assessing 
model structural adequacye The need to use X* values to 
measure the validity of a model must be considered as a 
major, important and unexpected result of the derivation of 
a distribution of observed flowse The use of both GLS 
regression and “residual testing of model adequacy" is a 
very significant advance in modellinge Residuals for 
individuals mean Little in origin modelling because of the 
large variance between observations, and the expected or 
probable behaviour is estimatede In origin-destination flow 
modelling a given flow may be accurate to plus or minus 1% 
if the flow is largee 

The use of arbitary assumptions to describe (apriori ) 
how people respond to alternative park sites is discussed in 
TN 3-e Such assumptions may Lead to statements about 
behaviour which are illogicale TN 3 (originally prepared to 
further theoretical dewelopment of the model presented in TN 
i) makes clear the importance of alternative factors in 
assessing how people respond to the supply of parkse It 
relates nicely to a variety of behavioural considerations 
which must be considered to construct "good" behavioural 
models which are discussed in TN 33-6 Still, one should note 
that elaborate models that tap aspects of behaviour, such as 
substituting one activity for another are not empirically 
validated at this timee 

In a number of the CORD Study efforts, data problems 
caused cancellation of planned researche TN 30 reports one 
of these cancelled efforts. As a consequence of formulating 
a work plan, it became apparent that there was tittle 
Likelihood that the study objectives could be achievwede TIN 
30 is an example of good research practice, in that a 
research design should be developed to the point at which an 
assessment of potential success can be madese If this 
assessment reveals that study objectives are unachieveable,y 
then research should be terminated. In addition to this 
“Lesson", TN 30 also serves another function in that it 
brought together a number of inventive proposals for model 
developmente These proposals have implications and 
interesting parallels with the formulations presented in TN 
iie The proposals presented in TN 11 were independent of TN 
30 —- they were based on considerations raised by research 
questions reported in TN 4e 

Thus, a number of substantive methodological research 
projects in the CORD Study have been concerned with 
destination modellinge Many of these indicate that much of 
the current work on destination modelling results in models 
which do not explain data as one might expecte One maJjor 
reason is the fact that often these models are developed 
from unreliable datae Another major factor is the lack of 
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adequate consideration of the disagregation of user groups 
into groups for which meaningful models can be developede A 
serious question which must also be considered is that when 
use estimates are made for a given point in time, how can 
these estimates be extrapolated for future policy and 
planninge In reality, considering the various methodological 
problems and the special difficulties associated with the 
use of destination models in planning and policy analysis, 
it is apparent that progress made in this area is quite 
minimale 


ATTRACTIVITY ANALYSIS 


How people respond psychologically to a tlandscape 
obviously does not have a simple relationship to estimating 
the number of visits that people will make to a particular 
parke The matter of park attractivity, and wild river site 
attractivity, have been examined as separate issuese 

A consumer preference approach can be used to derive 
park attractiveness measures (TN 2.) However, there are a 
number of methodological problems associated with this 
approach as a consequence of Limited and poor datae From a 
planning perspective, the ordinal attractivity scale has 
only limited merit in making use estimates for parkse TN 9 
examines the question of what really is measured by 
different types of attractiveness measures, and which of 
three types under consideration, are compatiblee One finding 
indicates that users respond not only to a park, but also to 
what is around ite TN 28 is concerned with another matter, 
namely, testing the "goodness" of attractiveness measurese 
There is controversy about this Note however, (see Chapter 
IIIf, Review) because it was indicated that when two 
different measures, based upon different assumptions, are 
put into the same model, comparable explantions should not 
be expectede 

Turning to site quality, the assessment of the quality 
of a site on a wild river cannot be explained by a linear 
combination of factors (TN 27.) From a practical point of 
view, park design models which only specify the relative 
importance of variables, and then just sum or multiply, are 
quantitatively and conceptually unsounde As indicated in TN 
27, people usually consider several resources at one timee 
The available “computer attractiveness evaluation packages" 
do not allow for multiple-consideration of resource 
variablese Furthermore », even if computer programmes 
included such an approach, there is still need for a valid, 
empirically based modele Effective use of computers to 
generate maps of peoples? reactions to resources requires 
extremely sophicated research work - work which has not been 
done at this timee 

Thus, when it comes to examining the more general 
questions concerning attractiveness, such as how people 
respond to trails, or lakes, or scenic byways, and the Like 
(see TN 22), has obvious research implicationse Such 
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complexities as a response at a given time with respect to a 
given experience, or how the repetition of the experience 
may diminish the value of individual site qualities, 
creating a different response at different times; or how a 
variety of experiences may enhance total experience and thus 
alter response, are all research questions to be pursuede 


SUPPLY ANALYSIS 


The use of multiple regression to calculate 
"consumption" of recreation which is conditioned by socio-— 
economic characteristics is illustrated in TN i126 TN 6 
demonstrates how the TN 12 calculation formula for making 
estimates is of such form, that accuracy of estimates can be 
derivede The merits of using the TN 12 methodology to make 
predictions must be viewed in Light of the accuracy 
considerations offered’ in TN 6e 

However, in this volume, it is also pointed out that 
the conceptual structural problems indicated in TN 13, and 
the model structural problems pursued in TN 20 must be 
recognizede 

TN 20 indicates that the variance in participation data 
is not adequately explained by use of the TN 12 modele TN 20 
was designed to determine interaction effects which could be 
introduced to improve the TN 12 model. It was demonstrated 
that some significant effects could be incorporated into 
ANOVA modelse Howewer, when an AID analysis was compared to 
an ANOVA analysis, it was found that an ANOVA model only 
explained about half of the variance it should thavee Thus 
including interaction effects does not improve simple ANOVA 
modelse The preceding means that accuracy estimation methods 
used in TN 6, can be used for models, but when interaction 
terms or supply factors are not properly considered or are 
not included in models, TN 6 formulae (at best) seriously 
underestimates the error to be expected. 

A number of considerations are important in defining 
the quantity of supplye Supply measurement is discussed in 
TN 16, where the measurement of how much supply is actually 
at each of a number of locations, and not with perceived 
supplye It must be recognized that the manner in which 
supply measurements are taken in one political jurisdiction 
may not be germane in another political JjJurisdictione In 
addition, the real supply may not thave a simple direct 
effect on use levelse It was pointed out in TN 29 as well 
as TN 34 that there have been studies in which researchers 
specified supply factors, and introduced these as additional 
independent variables when deriving an origin modele 
However, it has long been recognized that response to supply 
is non-linear (see TN il )e 

One important innovation nevertheless, presented in TN 
29, was that the effects of supply in each of the origin 
areas could be defined by what people do is in these areas 
(TN 29)e¢ The problem however, in determining how inventoried 
supply relates to the behaviourally based supply 
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coefficientse There is a need to consider programming of 
facilties, cultural factors, and other factors to relate 
model parameters to real supply and how it is usede 
Deriving an equation relating real supply to perceived 
supply is not a straightforward matter (see TN 10 and 37)e 
To do this there is a need to conduct supply research using 
very large data sets for model developmente 


METHODOLOGICAL CONSIDERATIONS 


Methodological considerations not covered elsewhere 
under other headings are discussed in this sectione To 
provide an illustration of tow an anlaysis of variance 
(ANOVA) model can be used, TN 15 was preparede It was though 
that some readers might find the example helpful in 
examining the use of ANOVA model parameters in a number of 
the Technical Notese Another area of methodological concern 
was data weightinge The procedures necessary to define 
appropriate weights for data collection inwvolwe a number of 
issuese TN 21 explores many of these issues; however, there 
is considerable room for development of new and more 
innovative weighting strategiese The material presented in 
TN 21 should not be considered optimal. 

The processing of geographical information was yet 
another major area of methodological consideratione Many 


people frequently misuse computer systems in handling 
geographic informatione There are numerous alternative 
methods of processing geographical informatione However, 


often vast amounts of information are coded without regard 
to the ultimate research objectivee Another major factor 
often ignored in computerized geographic data processing 
relates to the actual cost of manipulating informatione In 
addition, there is a tendency, because of the range of 
computer assistance available today, for a number of 
researchers to code and process much more data than is 
actually necessary - just because the computer resources are 
availablee Indeed this is a poor use of resourcese 

The issue of data compatibility is discussed in TN 24. 
It was learned that certain data which should be compatible 
with other data, was note This issue should have been 
pursued more fully in the CORD Studye The Note raises 
serious questions about the validity and reliability of 
datae 

In a fairly unbiased manner, TN 10 explores a rather 
interesting methodological consideratione Using the same 
data concerning participation in outdoor recreation 
activities, researchers conducted an R-mode Factor Analysis 
of the data, and a Cluster Analysise Unfortunately, results 
of these two techniques should not be (but often are) 
comparede "R" Factor Analysis groups activities, and Cluster 
Analysis groups like individualse A Q-mode Factor Analysis 
could have been used for this comparison, but wasn't, since 
the point was to demonstrate why R-mode Factor Analysis and 
Cluster Analysis differe Similar methodolgical concerns are 
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examined in TN 13,5 32, and 37 

Thus» The CORD Study thas explored a number of 
methdological considerations which are not only of concern 
to outdoor recreation research, but are germane to att 
Bocial and behavioural researche Findings indicate that it 
is important that similar methodological research he 
continued and considerable information exchange be 
encouragede This latter point is particularly true with 
respect to the many fields of study which have the same 
methodological concernse 


ALLOCATION MODELLING AND ALLOCATION EVALUATION 


Prior to 1974, there was Little available informetion 
on the economic value of park visitse There is need for 
researchers to develop this kind of information in a form 
acceptable to many economists, and also ina form that is 
meaningful for policy makerse TN 31 makes a unique and 
potentially controversial contribution by implying that a 
systems model (such as the one defined in TN 1) can be used 
to define a demand function for non-isolated sitese This is 
the type of research required so that economists and policy 
makers might have some understanding of the economic effect 
of changing e« system, ie@e —- what happens to demand under 
new supply conditionse 

Another issue considered in TN 31 is the estimation of 
per capita consumer surplus valuee However, a special 
“easy, fast, and cheap" means, is also described in TN 3Be 
Many may argue that this approach in TN 38 is not 
defensibles Special attention should be paid to the Listing 
of consumer surplus values presented in the Appendix to TN 
38, since this listing is the most extensive and diverse 
Listing for Canada which is now availablee 

The concept of potential surfaces (both a sociologic 
and geographic concept) may be used to define measures for 
evaluating the distribution of opportunities to participate 
in outdoor recreation activitiese Two approaches to 
measurement with respect to this concept are presented in ITN 
5, and in some respects, one can be said to be better than 
the othere The question of what distance function to use 


when certain gravity functions are specified remains 
unanswered. An interesting subsequent project would involve 
relating potential “measures to some statement about 


perceived supply or to compare views about needs derived by 
the "Potential Surface" method with people's perception of 
their needse 

TN 17 offers an excellent example of the kind of issues 
alluded to in the comments on TN Se If the policy decision 
is to provide opportunities for all people to have about the 
same per capita access to picnic sites within two hours 
drive of their homes, then there is no need to know how 
people behave, but rather there is need to have an inventory 
of the available opportunities for picnicse Such 
opportunities must be related to the population that will 
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use them and to the distance from opportunitiss, to the 
userse TN 17 provides a way of testing the goodness of any 
allocatione If certain access criteria are accepted, then TN 
7 indicates how allocation evaluation measures can be 
defined by supply informations With the information produced 


from an “evaluation run" and information on expected 
population growth, facility development plans may be 
proposed based upon a knowledge of the "consequences" of 


changes in supplye This is accomplished by repeated "runs" 
of the modele Thus the model may be used to develop a 
“capital plan" for increasing supply - a plan that is 
consistent with budget forecasts over a period of timee 

Concerns about the optimal use of land are examined in 
TN 236€ One approach to defining optimal use is linear 
programminge However, the question is never really "should 
there be a park or timber production" - creation of parks 
rarely has a simple or single objectives For example, if the 
concern is employment, the computer might still select a 
capital-intensive solutione A solution to the Land 
allocation problem might in fact take agricultural land out 
of production when it might be more valuable at a tater time 
for agriculturee Current automatic evaluation and allocation 
procedures fail to take into account the multi-facted 
aspects of determining what may be “optimat" ina given 
socio-political situatione 

Another issue concerning allocation models deals with 
behavioural datae TN 26 offers a method for using 
behavioural data in a disaggregated mannere For example, 
examining different age groups, or differences between males 
and females ~- rather than looking at crude participation 
rates for a “population". However, if a policy maker doesn't 
plan to make use of such information in planning or 
management action, such analyses are meaningless as the Note 
stressese 

In addition, sometimes data used for policy decisions 
are not quantitativee TN 25 indicates that it is possible to 
use intuition and judgement to scale qualitative datae The 
Note indicates that both quantitative and qualitative data 
should be used, however, controls should be built-in to 
prevent qualitative variables from dominatings and thus, 
making PPBS or similar management systems meaninglesse 

A focused review of traditional economic impact 
literature appears in TN 39-2 The Note leaves much to be 
desired because the Literature is so extensive, and the 
issues are quite subtlee The Note should be considered as 
introductory to "the state of the art" as of the Late 
1960'se In contrast, TN 40 offers a perspective on what 
influences the development of a parke The Note points out 
problems of taking into account that developments almost 
never proceed according to & proposed "master plan", 
particularly with respect to park acquisition negotiations. 
The question of impact assessment requires examination of a 
variety of exogeneous and indogeneous variablese Finally, 
in the matter of the success of campgrounds, TN 41 must be 
considered a "first" guide for program managers and 
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plannerse The economics of campground development must 
always be viewed within the context of the objectives of the 
proposed managemente 

Thus, it becomes obvious in this section that further 
research in the area of allocation modelling and allocation 
evaluation is extremely necessarye 


QUO VADIS ? 


If any conclusion can be drawn from the myriad avenues 
of research that have been reported in this volume, it is 
that future research to improve models must not be pursued 
in @ random or haphazard manner. Researchers must not accept 
only the priorities of policy and planning personnel, but 
must establish their own prioritiese Some work must be done 
ahead if results are to be produced at the time when 
requests may be expectede Furthermore, policy and planning 
personnel must understand that marginally relevant research 
can sometimes pay high dividends, and thus must permit 
researchers the time, and the Latitude to pursue the more 
esoteric Lines of inquiry opened by the CORD Studye 

While many readers of this volume may feel that some of 
the material presented is quite relevant to their concerns, 
they deo not understand how to use or adopt the methods 
presented to meet their needse Interpretative material on 
how to use various methods developed by the CORD Study needs 
to be issued for practitionerse Practitioners hawe need of 
methodological guidelines and examplar materialse Manuals 
for carrying out research and for deriving the implication 
of research findings for management must be conceptualized 
as “useable tools", if they are to be anything other than 
academic exercisese There are a Limited number of 
researchers who have the background required to make use of 
the material presented in this wolume in its present forme 
In fact, there is a limited number of source persons who 
have drawn together the various contents of the Technical 
Notese Still, this volume now may serve as text to increase 
the knowledge base of future outdoor recreation researcherse 
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APPENDIX A 


REGRESSION ANALYSIS WITH DICHOTOMOUS DEPENDENT VARIABLES 


VeKe Smith and Ce Cicchetti 


INTRODUCTION 


An increasing volume of social gcience survey research 
has required the use of dichotomous dependent variables to 
estimate Linear probability models. Linear probability 
functions may also be estimated from aggregate time series 
data where the dependent variables are not dichotomous (see 
Reference 10 )e AS @ consequence there has been considerable 
interest generated in the interpretation and the properties 
of statistics estimated with these models (see Reference 12, 
13, 18). Those texts which discuss estimators for such 
models (see Reference 7, 8, 17) present three types of 
estimation procedurese The first category includes the 
probit-lLlike techniques (eegey,y probit, logit, tobit)e While 
there is general agreement, on theoretical grounds, that 
these techniques are most desirable, they are infrequently 
used in practicee Cost considerations have been cited as a 
primary reasone A number of researchers (see Reference 2, 
6) have pointed to the computational costs of these methods 
with large data sets as primary obstacles to their 
applicatione In the context of the CORD Study, cost and 
other considerations are commented on in several Notes as 
well as in the review of Chapter VIIe« The second approach 
requires the use of a mathematical programming techniquee 
It is derived by selecting parameter estimates so0as to 
minimize the sum of squared residuals subject to inequality 
constraints upon the model's predictionse Judge and Takayma 
(see Reference 9) have noted that minimization of the sum of 
Bquared residuals subject to 0S xX(i) S 1 becomes a 
quadratic programming problem where the covariance matrix of 


coefficient estimates is not cleare Once again these 
techniques have not been employed in many applications (eege 
see Reference 10)- One reason stems from a general 


inability to derive a covariance matrix for the estimated 
coefficients (see Reference 9 )e 

The other set of techniques involves applying ordinary 
least squares (OLS) and generalized least squares (GLS) 
procedures directly to the datae These estimators (OLS and 
GLS) have been used in the majority of applied studies with 
dichotomous dependent variablese While their asymptotic 
properties are known, it is probably fair to say that there 
has been no evaluation of their sampling performancee 
Moreover, there is very little information available to 
discriminate between OLS and GLS procedurese 

The purpose of this paper is to present the results of 
a series of Monte Carlo experiments with models containing 
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dichotomous dependent variables. This study ig concerned 
only with OLS and GLS estimators. The evaluation of their 
performance involves consideration of both estimated 
coefficients and their estimated covariance matrixe In 
addition the power of Student t-tests is discussede 

There are several reasons for the restriction in scope 
of the papere However, the primary reason stems from the 
desire to evaluate methods which are used in practice for 
research with dichotomous variables and thereby serve to 
improwe the process of selection between theme Moreover, 
the popularity ot these techniques without explicit 
knowledge of their small sample properties (see Reference 
13) reinforces this considerations. The recent work by Cragg 
examines some of these issues (see Reference 4). 


MODEL 


Goldberger provides a clearcut statement of the 
dichotomous dependent variable Linear probability model in 
his texte Equation 1 is a statement of a linear probability 
function of the kind of concern heree The walues of Y(i) 
are restricted to zero or unityeo B is akK by i parameter 
vectore X is ail by kK wector of exogenous variables X(i) 
and U is the error vector with elements U(i )e 


(1) Y = X gtU 


In order to satisfy one assumption of the general 
Linear model, E(UCi)) = 0, Goldberger derived a prerequisite 
distribution for U(id)e He indicated that U(i) be a 
Bernoulli random variable for which the distribution is: 


(2) UC i)d*s VALUE PROBABILITY OF THAT VALUE 


i-x( i)B XC ids 
-X(i)B 1-x( ids 


This specification implies that U is heteroscedastice 
Accordingly, Aitken estimation of the model has been 
suggestede The variance of U(i) is given in Equation 3: 


(3) EC UCi)2) = XCi)B (1-x(0i)8) = ECY¥Ci)) C1-ECY¥(i))) 


McGilliwray (see Reference 11) has recently 
demonstrated that EVAR(Ci)= ylid€i-y(i)), where yii) is the 
OLS prediction, provides a consistent estimator of E(Y(i)) 
C(1-E( Y¥(i)))- Moreover the Aitken estimates which result are 
best asymptotically normal (BAN )e This estimate of the 
disturbance covariance matrix required for GLS estimation 
used ylid(i-y(i)) as the diagonal elementse There i868, 
however, a significant problem with this procedure, as 
Goldberger has noted: ".ceethere may be x(i)"s for which y(i) 
is outside the unit interval." In such cases, the estimates 
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of the wariance EVAR(i) are inadmissible and there is no 
clearly defined procedure for the application of the GLS 
techniquec 

Accordingly, three schemes for dealing with sample 
points which result in such inadmissible estimates are 
examinede Figure 1 shows a plot of the weighting scheme 
EVAR(i) for sample points for which predictions of their 
Y(i) Lie within the unit intervale The primary question of 
concern here however is what weight to assign to 
observations for which y(i)*s are outside the intervale One 
weighting technique gives outliers a weight, which 
corresponds to the smallest weight given to the observations 
within the 0 to 1 interval (this weight corresponds to the 
weight given to y(i) = e«5S)e This estimator is designated 
GLS1e This weighting function corresponds to line segments 
AB and CD in Figure i and implies a weight of 1/4 for those 
observations for which y(i) falls outside the unit interval. 
This scheme implies that the information embodied in these 
observations is smalle Alternatively, outliers ( see 
Reference 5) may be weighted in a manner (GLS2) which 
reduces the information content of the observations as a4 
function of the extent to which the estimated y(li) is 
outside the 0 to 1 intervale One method which accomplishes 
this is indicated by the weight curves EF and GH in Figure 
1e These curves are i divided by the absolute value of 
y(id*(C i-y(i) de Finally, observations outside the unit 
interval might be given larger weights in order to improve 
the Likelihood that the (Aitken) estimated probability 
function will not predict with sample points outside the 0 
to 1 intervale As representative of such cases a weight was 
chosen corresponding to y(i)=e98, whenever y(i) was actually 
less than zero or greater than one, and designated GLS3 
weightse In Figure 1, the weighting function corresponding 
to GLS3 is shown by IJ and KLe 

In summary, the three weighting functions and their 
corresponding generalized Least squares estimators are 
presented in Table ie 


DESIGN OF THE EXPERIMENTS 


A total of twelve experiments each with fifty samples 
have been conducted to compare the performances of OLS and 
the three GLS estimators described abovee There are three 
factors which distinguish these experiments: model, 
conditional distributions of U(i), and sample sizee Three 
conditional distributions for U(i) are used with two 
different modelse Each experiment is conducted with samples 
containing twelve and one hundred observationse The general 
form of the model used to generate samples is defined by 
Equation 5: 


(5) yCi) = BCO) + BC1IK( 1,1) + BC2Z)X(2,i) + Uli) 


The two models have different specifications for the 
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TABLE 1: ALTERNATIVE GLS ESTIMATORS FUNCTIONS AS GIVEN 


IN FIGURE 1 

For y(i)LT 0 For 0 LT y(i)LT 1 For y(i)GT 1 
cD 

GH 

KL 


GLS1 AB 1/EVAR (i) 
GLS2 EF 1/EVAR(i) 
GLS 3 IJ 1/EVAR(i) 
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TABLE 2: SUMMARY OF EXPERIMENT'S COEFFICIENT STRUCTURES 


Model and Aprox. Proportions Max P(Y=1) 
Experiment of Y's Equal to l 


This estimate is derived recognizing that the probability Y(i) is 


equal to one is equal to & B(j) x(j,i) where the sum is for 
j=zero to two. Substituting the mean values for independent variables 


one obtains the approximate proportion as reported above. 
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regressors, X(i,i) and X(2y,i )eo In Modet A there is a 
dichotomous independent variable, X(i,gi)d, and a continuous 
independent wariable X(2,i)e In Model B both independent 
variables are dichotomouse The values of the independent 
variables (for each sample size) were not changed across or 
within the different experiments for each modele 

They were obtained by generating twenty random numbers 
for each variable in the case of samples of size twenty and 
one hundred for the larger samplese. The variable X(1,i) came 
from a discrete distribution and had values of zero and 
unity with P(X(1,1)=1)=eS5- For model A, X( 2,1) came from a 
normal population, N(2;1i°eQ)e For model By, X(2,i) was 
discreet with values of zero and unity, with P(X( 2, i )=i J=02- 
The variables for both models were independently generatede 
In Table 2, the coefficient structures used for each 
experiment are presented, and the effect upon the 
distribution of the dependent variable Y¥(i)d, is atlso 
“described" in each casée 

The distribution U(i) was altered by changing the 
intercept of the equations for each modele This change, 
with the same values for the independent variables, 
respecifies both the magnitude of the errors and their 
probability distributiongse 


PERFORMANCE OF THE ESTIMATORS 
IN OBTAINING COEFFICIENT ESTIMATES 


Two types of measures of performance for OLS, GLS1i, 
GLS2 and GLS3 estimators are reported for each modele They 
are parametric and non-parametric statisticse In the actual 
analysis, results were produced for each coefficient in each 
experiment for both modelse Sample mean, root mean square 
error (RMSE), bias, and wariance for coefficients were 
developed and examined for each coefficient experiment-model 
combinatione A number of patterns were apparente Using 
Model A with sample sizes of 20, in terms of RMSE and 
variance explained, GLS3 appears to be better than OLS and 
the other GLS techniquese In Model B, with samples of 20, 
OLS comes somewhat closer to the GLS estimators using these 
same criteriae 

Increasing sample size to 100 observations was seen to 
have three effectse (1) Each technique’s performance under 
all four criteria improves as would be expected with 
increased sample sizee Woreover, the parametric statistics 
(ecge, bias, RMSE and variance) for each estimator are, with 
but few exceptions, one-half their former levelse (2) The 
perceived discrepancy between the performance of the three 
GLS techniques is virtually eliminated. Thus the procedure 
used to accommodate inadmissible variance estimates does not 
appear important with larger samplese (3) Finally,sy the 
observed discrepancy between the GLS techniques and OLS with 
samples of twenty (20) observations is practically 
eliminated when the sample size is increased to one hundred 
(100) observationse 
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Another aspect of the results that were considered 
concerns decentralizatione Quandi (see Reference i4) 
introduced the decentralizaticn measure when the primary 
concern was for the frequency of extreme errorée It is 
given by the percent of samples within which the estimated 
coefficients had the wrong sign (relative to the total 
number of samples). Table 3 presents this percentage for 
the OLS and GLS estimatorse There is no difference in 
decentralization between GLS1, GLS2 and GLS3, therefore, the 
decentralization for these estimators is reported in Table 3 
under the generat teading GLSe In nearly atl cases, 
increased sample size greatly reduces or eliminates the 
decentralization that is present in samples of twenty 
observationge Furthermore, the pattern does not appear to 
differ appreciably across the two estimating techniquese 

Also in Table 3 there ig a Kendall's coefficient of 
concordance for each structural coefficient across allt 
experimentae This nonparametric sastatistic measures the 
agreement in ranking (according to absolute proximity to the 
true walue) of OLS and each GLS estimator across the fifty 
samples of each experimente This measure is catculated by 
comparing each estimator's rank sum with what it would have 
been if there was perfect agreemente Accordingly, the 
coefficient may be loosely interpreted as the percent of 
perfect agreement in these rankings. 

Overall the estimator rankings are not highly 
consistent in either model for any of the GLS techniquess 
It appears that the consistency in the ranking declines with 
increases in sample sizee While there are some exceptions, 
this conclusion reinforces the results found for the 
parametric statisticse That is, the techniques do not 
exhibit appreciable difference in performance patterns with 
Larger samples. 


ESTIMATES OF THE COEFFICIENT VARIANCES 


An additional aspect of the performance of each 
estimating technique is its estimate of the estimated 
coefficients, covariance matrixe The calculation of these 


variance-covariance matrices is carried out using the 
standard formula referred to latere The OLS estimates of 
the variance-covariance matrix for the estimated 


coefficients are known to be biased in the presence of 
heteroscedastic disturbances (see References i, 7, 83% 17 )e 
Still GLS;, in principle, should provide asymptotically 
unbiased estimatese Table 4 presents the mean of the fifty 
estimates of the estimated coefficients, variances for 
models A and Be In addition the mean squared errors (MSE) 
of the individual coefficient's estimates are presented for 
comparisone The difference should provide some indication 
of the extent of the bias in the respective estimatorse 

Some overall observations are possiblee (1) OLS and 
all GLS estimates of the variance in the estimated 
coefficients are biasede The magnitude of the bias 
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Table 5: STUDENT'S T-TEST PERFORMANCE 
Percent of Samples for which the Null Hypothesis were rejected 


Model Range 
and in 


Experiment GLS1-3 GLS2 GLS3 
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diminishes with increased sample sizee (2) The distinction 
between the three GLS estimators of the variance virtually 
disappears when the sample is increased to a size of one 
hundred. 

As a final step we consider the suitability of 
Student's t-tests for hypothesis-testing with Linear 
probability models for the dichotomous dependent variables 
studied heree Table 5S shows the percent of the sample in 
which the null hypothesis (coefficients equal zero) was 
rejected at the -05 Llevele These results indicate that the 
power of this test is quite Limited for small samples (20 or 
Less )e Furthermore, there does not appear to be any 
difference in the performance of the four estimators, which 
were considerede However, with increased sample size (100) 
the t-test becomes a much more powerful test and it becomes 
appropriate to test hypotheses with the estimated 
coefficientse 


CONCLUSIONS 


These experiments provide a number of interesting 
findingse First, under small sample conditions, a GLS 
estimator, which gives a large weight to observations 
outside the 0 to i interval seems to be marginally better 
than OLS when the regressors include a continuous variablee 
However, this ranking is weakened if the regressors are both 


dichotomous variablese Moreover, when the sample size is 
increased (to one hundred observations ) there are no 
apparent gains from Aitken (GLS) estimatione The results 


based upon both parametric and nonparametric statistics for 
the estimated coefficients reinforce this findinge 

Second, both the OLS and GLS estimates of the estimated 
coefficients, variances are biasede While the magnitude of 
this bias diminishes with increased sample size, there is no 
apparent difference across estimatorse 

Third, the power of the Student's t-test for testing 
hypotheses with the estimated coefficients is very Limited 
with small samples irrespective of the estimator selectede 
An increase in the sample size results in a more acceptable 
performance for this teste However, there is stilt no 
discernible advantage gained by using Aitken (GLS) 
estimation with larger samplese 

Probably of most practical importance is that much 
survey research using dichotomous dependent variables for 
the estimation of Linear probability functions utilizes 
fairly large data sets (in excess of one hundred 
observations) for statistical estimatione As a& consequence, 
"probit-Like" estimators impose substantial computational 
costse Therefore, OLS or GLS procedures are frequently 
chosene While our results cannot discern the toss in 
efficiency if an OLS estimator is utilized, they do indicate 
that there is Little or no gain from using Aitken estimation 
for such large data setse Moreover, traditional hypothesis 
testing with the Student's t-test for the coefficients of 
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such models can probably be expected to yield reasonably 
powerful tests with Large samplese 

A note of caution in closinge It must be emphasized 
that the above conclusions are based upon experimental 
models which assume that specification errors ere  ~not 
present in the modele Further research should address 
models of this kind when specification errors are present 
(see TN 6, 20, 29, and 36)e 
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